Measuring the retrievability of digital library content using analytics data-Reference-Cited by-同舟云学术

Measuring the retrievability of digital library content using analytics data

Published:2024-03-19 Issue: Volume: Page:
ISSN:2330-1635
Container-title:Journal of the Association for Information Science and Technology
language:en
Short-container-title:Asso for Info Science & Tech

Author:

Jahani Hamed¹^ORCID,Azzopardi Leif²,Sanderson Mark³

Affiliation:

1. School of Accounting, Information Systems and Supply Chain RMIT University Melbourne Victoria Australia

2. University of Strathclyde Glasgow UK

3. School of Computing Technologies RMIT University Melbourne Victoria Australia

Abstract

AbstractDigital libraries aim to provide value to users by housing content that is accessible and searchable. Often such access is afforded through external web search engines. In this article, we measure how easily digital library content can be retrieved (i.e., how retrievable) through a well‐known search engine (Google) using its analytics platforms. Using two measures of document retrievability, we contrast our results with simulation‐based studies that employed synthetic query sets. We determine that estimating the retrievability of content given a Digital Library index is not a strong predictor of how retrievable the content is in practice (via external search engines). Retrievability established the notion that search algorithms can be biased. In our work, we find that while there such bias is present, much of the variation in retrievability appears to be strongly influenced by the queries submitted to the library, a side of retrievability less examined in past work.

Publisher

Wiley

Reference37 articles.

1. Abolghasemi A. Verberne S. Askari A. &Azzopardi L.(2023).Retrievability bias estimation using synthetically generated queries. In The first workshop on generative information retrieval (GenIR@SIGIR23).

2. Alaofi M. Gallagher L. Sanderson M. Scholer F. &Thomas P.(2023).Can generative llms create query variants for test collections? An exploratory study. In Proceedings of the 46th international ACM SIGIR conference on research and development in information retrieval (pp. 1869–1873).

3. Azzopardi L. &Bache R.(2010).On the relationship between effectiveness and accessibility. In Proceedings of the 33rd international ACM SIGIR conference on research and development in information retrieval (pp. 889–890).

4. Azzopardi L. &Vinay V.(2008a).Accessibility in information retrieval. In European conference on information retrieval Springer (pp. 482–489).

5. Azzopardi L. &Vinay V.(2008b).Document accessibility: Evaluating the access afforded to a document by the retrieval system. In Workshop on novel methodologies for evaluation in information retrieval Citeseer (pp. 52–60).