Systematic review of health research using internet search data

Author:

Thompson Matthew1,Chan Calvin2,Daniels Elisabeth2,Obana Kevin1,Taylor James3,Grailey Kate2,Schneider Renee1,Flatley John1,Sounderajah Viknesh1,Darzi Ara2ORCID

Affiliation:

1. Google, Inc.

2. Imperial College London

3. Google Health

Abstract

Abstract

Novel types of digital data, including internet search data, have potential to improve understanding of early predictors of serious health conditions and enable timely management. While many studies have used aggregate anonymized search trends in this way, what is less clear is the predictive or diagnostic value of online searches at the individual level. While an increasing number of studies have used these kinds of data, this research method is still emerging. We therefore undertook a systematic review of published research that has assessed the predictive or diagnostic value of individual internet search data. MEDLINE and Embase were searched through March 2024 for studies utilising individual internet search data to predict or diagnose patient disease status. Due to the heterogeneous nature of the design, methodology and reported outcomes of included studies, a narrative synthesis of studies and pre-specified outcomes was performed. Study quality was assessed with the Newcastle-Ottawa Scale and PROBAST tool. Twenty-three studies met the inclusion criteria. Conditions of interest encompassed mental health, neurological conditions, malignancies, and miscellaneous healthcare presentations. Data on individuals’ search history were obtained from search engines using anonymous search queries (Bing, Yahoo!) or from consented participants (Google) where consent rates ranged from 20–70%. Wide variability in AUROC (range: <0.53 to > 0.99), sensitivity (range: 0.44 to 0.81) and F1 score (0.36 to 0.80) were reported. Studies noted a range of predictive linguistic, temporal, and other features (e.g., spelling error frequency). This review demonstrated that the use of individual internet search data holds diagnostic and predictive potential, with evidence of strong associative features. However, there was significant variability regarding conditions of interest, methodology, and predictive models used. Given the common use of internet searches by patients as part of their healthcare journeys, individual search data holds significant potential, and justifies further research, including the use of established diagnoses.

Publisher

Research Square Platform LLC

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3