The successes and challenges of harmonising juvenile idiopathic arthritis (JIA) datasets to create a large-scale JIA data resource
-
Published:2023-07-13
Issue:1
Volume:21
Page:
-
ISSN:1546-0096
-
Container-title:Pediatric Rheumatology
-
language:en
-
Short-container-title:Pediatr Rheumatol
Author:
Lawson-Tovey SaskiaORCID, Smith Samantha LouiseORCID, Geifman NopharORCID, Shoop-Worrall StephanieORCID, Ng SandraORCID, Barnes Michael R.ORCID, Wedderburn Lucy R.ORCID, Hyrich Kimme L.ORCID, Kartawinata Melissa, Wanstall Zoe, Jebson Bethany R., McNeece Alyssia, Ralph Elizabeth, Alexiou Vasiliki, Dekaj Fatjon, Kimonyo Aline, Merali Fatema, Sumner Emma, Robinson Emily, Feilding Freya L., Dick Andrew, Beresford Michael W., Carlsson Emil, Fairlie Joanna, Gritzfeld Jenna F., Ramanan Athimalaipet, Duerr Teresa, Eyre Stephen, Raychaudhuri Soumya, Morris Andrew, Yarwood Annie, Smith Samantha, Bowes John, Martin Paul, Tordoff Melissa, Stadler Michael, Thomson Wendy, Tarasek Damian, Wallace Chris, Lin Wei-Yu, Clarke Sarah, Kent Toby, Sornasse Thierry, Dastros-Pitei Daniela, Mukherjee Sumanta, Roberts Jacqui, Kallala Rami, Neale Helen, Ioannou John, Al-Mossawi Hussein,
Abstract
Abstract
Background
CLUSTER is a UK consortium focussed on precision medicine research in JIA/JIA-Uveitis. As part of this programme, a large-scale JIA data resource was created by harmonizing and pooling existing real-world studies. Here we present challenges and progress towards creation of this unique large JIA dataset.
Methods
Four real-world studies contributed data; two clinical datasets of JIA patients starting first-line methotrexate (MTX) or tumour necrosis factor inhibitors (TNFi) were created. Variables were selected based on a previously developed core dataset, and encrypted NHS numbers were used to identify children contributing similar data across multiple studies.
Results
Of 7013 records (from 5435 individuals), 2882 (1304 individuals) represented the same child across studies. The final datasets contain 2899 (MTX) and 2401 (TNFi) unique patients; 1018 are in both datasets. Missingness ranged from 10 to 60% and was not improved through harmonisation.
Conclusions
Combining data across studies has achieved dataset sizes rarely seen in JIA, invaluable to progressing research. Losing variable specificity and missingness, and their impact on future analyses requires further consideration.
Funder
Medical Research Council Versus Arthritis Great Ormond Street Hospital Charity AbbVie Swedish Orphan Biovitrum Olivia's Vision
Publisher
Springer Science and Business Media LLC
Subject
Immunology and Allergy,Rheumatology,Pediatrics, Perinatology and Child Health
Reference19 articles.
1. Minden K. Juvenile Idiopathic Arthritis in Adolescence and Young Adulthood. In: Adolescent and Young Adult Rheumatology in Clinical Practice. Springer, Cham; 2019. p. 85–105. Available from: https://link.springer.com/chapter/https://doi.org/10.1007/978-3-319-95519-3_7 2. Doiron D, Burton P, Marcon Y, Gaye A, Wolffenbuttel BHR, Perola M, et al. Data harmonization and federated analysis of population-based studies: The BioSHaRE project. Emerg Themes Epidemiol. 2013 Nov 21 [cited 2021 Mar 12];10:12. Available from: https://ete-online.biomedcentral.com/articles/https://doi.org/10.1186/1742-7622-10-12 3. CLUSTER Consortium. [cited 2021 Mar 4]. Available from: https://www.clusterconsortium.org.uk/ 4. Doan A, Halevy A, Ives Z.
1 - Introduction. In: Doan A, Halevy A, Ives Z, editors. Principles of Data Integration. Morgan Kaufmann;
2012. p. 1–18. https://doi.org/10.1016/B978-0-12-416044-6.00001-6. https://www.sciencedirect.com/science/article/pii/B9780124160446000016. 5. United Nations Statistical Office. Volume 1: Legal, organizational and technical aspects. In: Handbook of Vital Statistics Systems and Methods. United Nations. 1991. ISBN: 9211613280.
|
|