Electronic Library, Volume (36), No (3), Year (2018-5) , Pages (430-444)

Title : ( Domain-specific readability measures to improve information retrieval in the Persian language )

Authors: Sholeh Arastoopoor ,

Access to full-text not allowed by authors

Citation: BibTeX | EndNote

Abstract

Purpose – The degree to which a text is considered readable depends on the capability of the reader. This assumption puts different information retrieval systems at the risk of retrieving unreadable or hard-to-be-read yet relevant documents for their users. This paper aims to examine the potential use of concept-based readability measures along with classic measures for re-ranking search results in information retrieval systems, specifically in the Persian language. Design/methodology/approach – Flesch–Dayani as a classic readability measure along with document scope (DS) and document cohesion (DC) as domain-specific measures have been applied for scoring the retrieved documents from Google (181 documents) and the RICeST database (215 documents) in the field of computer science and information technology (IT). The re-ranked result has been compared with the ranking of potential users regarding their readability. Findings – The results show that there is a difference among subcategories of the computer science and IT field according to their readability and understandability. This study also shows that it is possible to develop a hybrid score based on DS and DC measures and, among all four applied scores in re-ranking the documents, the re-ranked list of documents based on the DSDC score shows correlation with re-ranking of the participants in both groups. Practical implications – The findings of this study would foster a new option in re-ranking search results based on their difficulty for experts and non-experts in different fields. Originality/value – The findings and the two-mode re-ranking model proposed in this paper along with its primary focus on domain-specific readability in the Persian language would help Web search engines and online databases in further refining the search results in pursuit of retrieving useful texts for users with differing expertise.

Keywords

, Information retrieval, Document cohesion, Document scope, Flesch–Dayani formula, Persian, Re-ranking search results, Readability scores
برای دانلود از شناسه و رمز عبور پرتال پویا استفاده کنید.

@article{paperid:1070636,
author = {Arastoopoor, Sholeh},
title = {Domain-specific readability measures to improve information retrieval in the Persian language},
journal = {Electronic Library},
year = {2018},
volume = {36},
number = {3},
month = {May},
issn = {0264-0473},
pages = {430--444},
numpages = {14},
keywords = {Information retrieval; Document cohesion; Document scope; Flesch–Dayani formula; Persian; Re-ranking search results; Readability scores},
}

[Download]

%0 Journal Article
%T Domain-specific readability measures to improve information retrieval in the Persian language
%A Arastoopoor, Sholeh
%J Electronic Library
%@ 0264-0473
%D 2018

[Download]