Trialstreamer: A living, automatically updated database of clinical trial reports

Author:

Marshall Iain J1,Nye Benjamin2,Kuiper Joël3,Noel-Storr Anna4,Marshall Rachel5,Maclean Rory1,Soboczenski Frank1,Nenkova Ani6,Thomas James7,Wallace Byron C2

Affiliation:

1. School of Population Health and Environmental Sciences, King’s College London, London, United Kingdom

2. Khoury College of Computer Sciences, Northeastern University, Boston, Massachusetts, USA

3. Vortext Systems, Groningen, the Netherlands

4. Cochrane Dementia Group, University of Oxford, Oxford, United Kingdom

5. Cochrane Editorial and Methods Department, London, United Kingdom

6. Computer and Information Science, University of Pennsylvania, Philadelphia, Pennsylvania, USA

7. EPPI-Centre, UCL Social Research Institute, University College London, London, United Kingdom

Abstract

Abstract Objective Randomized controlled trials (RCTs) are the gold standard method for evaluating whether a treatment works in health care but can be difficult to find and make use of. We describe the development and evaluation of a system to automatically find and categorize all new RCT reports. Materials and Methods Trialstreamer continuously monitors PubMed and the World Health Organization International Clinical Trials Registry Platform, looking for new RCTs in humans using a validated classifier. We combine machine learning and rule-based methods to extract information from the RCT abstracts, including free-text descriptions of trial PICO (populations, interventions/comparators, and outcomes) elements and map these snippets to normalized MeSH (Medical Subject Headings) vocabulary terms. We additionally identify sample sizes, predict the risk of bias, and extract text conveying key findings. We store all extracted data in a database, which we make freely available for download, and via a search portal, which allows users to enter structured clinical queries. Results are ranked automatically to prioritize larger and higher-quality studies. Results As of early June 2020, we have indexed 673 191 publications of RCTs, of which 22 363 were published in the first 5 months of 2020 (142 per day). We additionally include 304 111 trial registrations from the International Clinical Trials Registry Platform. The median trial sample size was 66. Conclusions We present an automated system for finding and categorizing RCTs. This yields a novel resource: a database of structured information automatically extracted for all published RCTs in humans. We make daily updates of this database available on our website (https://trialstreamer.robotreviewer.net).

Funder

UK Medical Research Council

National Institutes of Health under the National Library of Medicine

Publisher

Oxford University Press (OUP)

Subject

Health Informatics

Reference26 articles.

1. The Cochrane Collaboration: preparing, maintaining, and disseminating systematic reviews of the effects of health care;Chalmers;Ann N Y Acad Sci,1993

2. Becoming an information master: a guidebook to the medical information jungle;Shaughnessy;J Fam Pract,1994

3. Seventy-five trials and eleven systematic reviews a day: how will we ever keep up?;Bastian;PLoS Med,2010

4. Machine learning for identifying randomized controlled trials: an evaluation and practitioner’s guide;Marshall;Res Synth Methods,2018

Cited by 40 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3