Improving the quality, speed and transparency of curating data to the Observational Medical Outcomes Partnership (OMOP) Common Data Model using the Carrot tool (Preprint)

Author:

Cox SamuelORCID,Masood ErumORCID,Panagi VasilikiORCID,Macdonald CalumORCID,Milligan GordonORCID,Horban ScottORCID,Santos RobertoORCID,Hall ChrisORCID,Lea DanielORCID,Tarr SimonORCID,Mumtaz ShahzadORCID,Akashili EmekaORCID,Rae AndyORCID,Cole ChristianORCID,Sheikh AzizORCID,Jefferson EmilyORCID,Quinlan Philip RoyORCID

Abstract

UNSTRUCTURED

The use of data standards is low across the healthcare system and therefore to undertake international research it is usually required to convert data to a common data model. One such model is the Observational Medical Outcomes Partnership (OMOP) Common Data Model. It has gained significant traction across researchers and those who have developed data platforms. The Observational Healthcare Data Sciences and Informatics (OHDSI) partnership manage OMOP and provide many open-source tooling to assist those with data to convert their data to the OMOP CDM. The challenge, however, is in the skills, knowledge, know-how and capacity within teams to convert their data to OMOP. The European Health Data Evidence Network (EHDEN) provided funds to allow data owners to bring in external resource to do the required conversions and therefore creating a once in time conversion of data. The Carrot software is a new set of open-source tools designed to help address these challenges while not requiring data access by external resources. Data protection rules are increasing and privacy by design is a core principle under the European and UK legislations related to data protection. Our aims for the Carrot software were to have a standardised mechanism for managing the data curation process, capturing the rules used to convert the data, and creating a platform that can re-use rules across projects to drive standardisation of process, improve the speed, and without compromising on quality. Most importantly, the privacy by design approach was to deliver this approach without requiring those creating the rules to have access to any of the data. Carrot has been delivered and has been used on a project called CO-CONNECT to assist in the process of allowing datasets to be discovered via a federated platform. It has been used to create over forty five thousand rules and over 5 million of patient records have been converted. This has been achieved while maintaining our principles of ensuring this can be achieved with no access to the underlying data by the team creating the rules. It has also facilitated the re-use of existing rules, with the majority of rules being re-used rather than manually curated. Carrot has demonstrated how it can be utilised alongside existing OHDSI tools with a focus on the mapping stage. In the CO-CONNECT project it successfully managed to re-use rules across datasets. The approach is valid and brought the benefits expected with future work continuing to optimise the generation of rules.

Publisher

JMIR Publications Inc.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3