Advancing data science in drug development through an innovative computational framework for data sharing and statistical analysis

Author:

Mallon Ann-MarieORCID,Häring Dieter A.,Dahlke Frank,Aarden Piet,Afyouni Soroosh,Delbarre Daniel,El Emam Khaled,Ganjgahi Habib,Gardiner Stephen,Kwok Chun Hei,West Dominique M.,Straiton Ewan,Haemmerle Sibylle,Huffman Adam,Hofmann Tom,Kelly Luke J.,Krusche Peter,Laramee Marie-Claude,Lheritier Karine,Ligozio Greg,Readie Aimee,Santos Luis,Nichols Thomas E.,Branson Janice,Holmes Chris

Abstract

Abstract Background Novartis and the University of Oxford’s Big Data Institute (BDI) have established a research alliance with the aim to improve health care and drug development by making it more efficient and targeted. Using a combination of the latest statistical machine learning technology with an innovative IT platform developed to manage large volumes of anonymised data from numerous data sources and types we plan to identify novel patterns with clinical relevance which cannot be detected by humans alone to identify phenotypes and early predictors of patient disease activity and progression. Method The collaboration focuses on highly complex autoimmune diseases and develops a computational framework to assemble a research-ready dataset across numerous modalities. For the Multiple Sclerosis (MS) project, the collaboration has anonymised and integrated phase II to phase IV clinical and imaging trial data from ≈35,000 patients across all clinical phenotypes and collected in more than 2200 centres worldwide. For the “IL-17” project, the collaboration has anonymised and integrated clinical and imaging data from over 30 phase II and III Cosentyx clinical trials including more than 15,000 patients, suffering from four autoimmune disorders (Psoriasis, Axial Spondyloarthritis, Psoriatic arthritis (PsA) and Rheumatoid arthritis (RA)). Results A fundamental component of successful data analysis and the collaborative development of novel machine learning methods on these rich data sets has been the construction of a research informatics framework that can capture the data at regular intervals where images could be anonymised and integrated with the de-identified clinical data, quality controlled and compiled into a research-ready relational database which would then be available to multi-disciplinary analysts. The collaborative development from a group of software developers, data wranglers, statisticians, clinicians, and domain scientists across both organisations has been key. This framework is innovative, as it facilitates collaborative data management and makes a complicated clinical trial data set from a pharmaceutical company available to academic researchers who become associated with the project. Conclusions An informatics framework has been developed to capture clinical trial data into a pipeline of anonymisation, quality control, data exploration, and subsequent integration into a database. Establishing this framework has been integral to the development of analytical tools.

Publisher

Springer Science and Business Media LLC

Subject

Health Informatics,Epidemiology

Reference19 articles.

1. Lublin, et al. Defining the clinical course of multiple sclerosis The 2013 Revisions. Neurology. 2014;83(3):278–86.

2. El Emam K. Guide to the De-Identification of Personal Health Information. CRC Press (Auerbach), 2013.

3. Article 29 Data Protection Working Party. Opinion 05/2014 on Anonymization Techniques. (2014).

4. European Medicines Agency. European Medicines Agency policy on publication of data for medicinal products for human use: Policy 0070.”= Oct. 02, 2014, [Online]. Available: http://www.ema.europa.eu/docs/en_GB/document_library/Other/2014/10/WC500174796.pdf.

5. Health Canada. Guidance document on public release of clinical information, Apr. 01, 2019. https://www.canada.ca/en/health-canada/services/drug-health-product-review-approval/profile-public-release-clinical-information-guidance.html.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3