Abstract
AbstractBackgroundSecondary analysis of data from completed randomized controlled trials (RCTs) is a critical and efficient way to maximize the potential benefit from past research. De-identified primary data from completed RCTs have been increasingly available in recent years; however, the lack of standardized data products is a major barrier to further use of these valuable data. Pre-statistical harmonization of data structure, variables and codebooks across RCTs would facilitate secondary data analysis including meta-analysis and comparative effectiveness studies. We describe a data harmonization initiative to harmonize de-identified primary data from substance use disorder (SUD) treatment RCTs funded by the National Institute on Drug Abuse (NIDA) available on the NIDA Data Share website.MethodsHarmonized datasets with standardized data structures, variable names, labels, and definitions and harmonized codebooks were developed for 36 completed RCTs. Common data domains were identified to bundle data files from individual RCTs according to relevant subject areas. Variables within the same instrument were harmonized if at least two RCTs used the same instrument. The structures of the harmonized data were determined based on the feedback from clinical trialists and SUD research experts.ResultsWe have created a harmonized database of variables across 36 RCTs with a build-in label, and a brief definition for each variable. Data files from the RCTs have been consistently categorized into eight domains (enrollment, demographics, adherence, adverse events, physical health measures, mental-behavioral-cognitive health measures, self-reported substance use measures, and biologic substance use measures). Harmonized codebooks and instrument/variable concordance tables have also been developed to help identify instruments and variables of interest more easily.ConclusionsThe harmonized data of RCTs of SUD treatments can potentially promote future secondary data analysis of completed RCTs, allowing combining data from multiple RCTs and provide guidance for future RCTs in SUD treatment research.
Publisher
Cold Spring Harbor Laboratory
Reference14 articles.
1. National Institutes of Health. NIH Data Sharing Policy and Implementation Guidance 2003 [Available from: https://grants.nih.gov/grants/policy/data_sharing/data_sharing_guidance.htm#goals.
2. NIMH. The National Institute of Mental Health Data Archive (NDA) [02/26/2020]. Available from: https://nda.nih.gov/about/contact-us.html.
3. CSDR. Clinical Study Data Request [Available from: https://www.clinicalstudydatarequest.com/.
4. Vivli. Center for Global Clinical Research Data [Available from: https://vivli.org/.
5. A centralized informatics infrastructure for the National Institute on Drug Abuse Clinical Trials Network