Abstract
Purpose
– The purpose of this paper is to highlight the retrieval effectiveness of search engines taking into consideration both precision and relative recall.
Design/methodology/approach
– The study is based on search engines that are selected on the basis of Alexa (Actionable Analytics for the web) Rank. Alexa listed top 500 sites, namely, search engines, portals, directories, social networking sites, networking tools, etc. But the scope of study is confined to only general search engines on the basis of language which was confined to English. Therefore only two general search engines are selected for the study . Alexa reports Google.com as the most visited website worldwide and Yahoo.com as the fourth most visited website globally. A total of 15 queries were selected randomly from PG students of Department of Library and Information Science during a period of eight days (from May 8 to May 15, 2014) which are classified manually into navigational, informational and transactional queries. However, queries are largely distributed on the two selected search engines to check their retrieval effectiveness as a training data set in order to define some characteristics of each type. Each query was submitted to the selected search engines which retrieved a large number of results but only the first 30 results were evaluated to limit the study in view of the fact that most of the users usually look up under the first hits of a query.
Findings
– The study estimated the precision and relative recall of Google and Yahoo. Queries using concepts in the field of Library and Information Science were tested and were divided into navigational queries, informational queries and transactional queries. Results of the study showed that the mean precision of Google was high with (1.10) followed by Yahoo with (0.88). While as, mean relative recall of Google was high with (0.68) followed by Yahoo with (0.31), respectively.
Research limitations/implications
– The study highlights the retrieval effectiveness of only two search engines.
Originality/value
– The research work is authentic and does not contain any plagiarized work.
Subject
Library and Information Sciences,Computer Science Applications,Information Systems
Reference38 articles.
1. Ashkan, A.
,
Clarke, C.L.A.
,
Agichtein, E.
and
Guo, Q.
(2008), “Characterizing query intent from sponsored search clickthrough data”, SIGIR-IRA, available at: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.155.3370
&
rep=rep1
&
type=pdf (accessed July 16, 2015).
2. Bar-Ilan, J.
(2007), “Methods for measuring search engine performance over time”, available at: wwwconference.org/www10/cdrom/posters/1018.pdf (accessed July 18, 2015).
3. Barr, C.
,
Jones, R.
and
Regelson, M.
(2008), “The linguistic structure of English web-search queries”, Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, pp. 1021-1030, available at: https://aclweb.org/anthology/D/D08/D08-1107.pdf (accessed July 14, 2015).
4. Bian, J.
,
Liu, T.Y.
,
Qin, T.
and
Zha, H.
(2010), “Ranking with query-dependent loss for web search”, available at: http://research.microsoft.com/en-us/people/tyliu/wsdm10.pdf (accessed July 21, 2015).
5. Bitirim, Y.
,
Tonta, Y.
and
Sever, H.
(2002), “Information retrieval effectiveness of Turkish search engines”,
Advances in Information Systems
, Vol. 2457 No. 2002, pp. 93-103, doi: 10.1007/3-540-36077-8_9.
Cited by
14 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献