Information Retrieval in an Infodemic: The Case of COVID-19 Publications-Reference-Cited by-同舟云学术

Information Retrieval in an Infodemic: The Case of COVID-19 Publications

Published:2021-09-17 Issue:9 Volume:23 Page:e30161
ISSN:1438-8871
Container-title:Journal of Medical Internet Research
language:en
Short-container-title:J Med Internet Res

Author:

Teodoro Douglas^ORCID,Ferdowsi Sohrab^ORCID,Borissov Nikolay^ORCID,Kashani Elham^ORCID,Vicente Alvarez David^ORCID,Copara Jenny^ORCID,Gouareb Racha^ORCID,Naderi Nona^ORCID,Amini Poorya^ORCID

Abstract

Background The COVID-19 global health crisis has led to an exponential surge in published scientific literature. In an attempt to tackle the pandemic, extremely large COVID-19–related corpora are being created, sometimes with inaccurate information, which is no longer at scale of human analyses. Objective In the context of searching for scientific evidence in the deluge of COVID-19–related literature, we present an information retrieval methodology for effective identification of relevant sources to answer biomedical queries posed using natural language. Methods Our multistage retrieval methodology combines probabilistic weighting models and reranking algorithms based on deep neural architectures to boost the ranking of relevant documents. Similarity of COVID-19 queries is compared to documents, and a series of postprocessing methods is applied to the initial ranking list to improve the match between the query and the biomedical information source and boost the position of relevant documents. Results The methodology was evaluated in the context of the TREC-COVID challenge, achieving competitive results with the top-ranking teams participating in the competition. Particularly, the combination of bag-of-words and deep neural language models significantly outperformed an Okapi Best Match 25–based baseline, retrieving on average, 83% of relevant documents in the top 20. Conclusions These results indicate that multistage retrieval supported by deep learning could enhance identification of literature for COVID-19–related questions posed using natural language.

Publisher

JMIR Publications Inc.

Subject

Health Informatics

Reference68 articles.

1. The scientific literature on Coronaviruses, COVID-19 and its associated safety-related research dimensions: A scientometric analysis and scoping review

2. Framework for Managing the COVID-19 Infodemic: Methods and Results of an Online, Crowdsourced WHO Technical Consultation

3. How to Fight an Infodemic: The Four Pillars of Infodemic Management

4. Infodemiology: the epidemiology of (mis)information

5. Exploring the use of web searches for risk communication during COVID-19 in Germany

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Did high frequency phone surveys during the COVID-19 pandemic include disability questions? An assessment of COVID-19 surveys from March 2020 to December 2022;BMJ Open;2024-07

2. Online Health Search Via Multidimensional Information Quality Assessment Based on Deep Language Models: Algorithm Development and Validation;JMIR AI;2024-05-02

3. A large-scale dataset of patient summaries for retrieval-based clinical decision support systems;Scientific Data;2023-12-18

4. Deep Learning-Based Good Cause Marketing and the Impact of the Internet on MICE Events in the Context of The Epidemic;Fluctuation and Noise Letters;2023-12-14

5. CONORM: Context-Aware Entity Normalization for Adverse Drug Event Detection;2023-09-26