A unified data infrastructure to support large-scale rare disease research-Reference-Cited by-同舟云学术

A unified data infrastructure to support large-scale rare disease research

Published:2023-12-20 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Johansson Lennart F.^ORCID,Laurie Steve^ORCID,Spalding Dylan^ORCID,Gibson Spencer,Ruvolo David,Thomas Coline^ORCID,Piscia Davide^ORCID,de Andrade Fernanda,Been Gerieke,Bijlsma Marieke,Brunner Han^ORCID,Cimerman Sandi,Yavari Dizjikan Farid,Ellwanger Kornelia^ORCID,Fernandez Marcos^ORCID,Freeberg Mallory^ORCID,van de Geijn Gert-Jan,Kanninga Roan,Maddi Vatsalya,Mehtarizadeh Mehdi,Neerincx Pieter,Ossowski Stephan^ORCID,Rath Ana^ORCID,Roelofs-Prins Dieuwke,Stok-Benjamins Marloes,van der Velde K. Joeri^ORCID,Veal Colin^ORCID,van der Vries Gerben,Wadsley Marc,Warren Gregory,Zurek Birte^ORCID,Keane Thomas^ORCID,Graessner Holm^ORCID,Beltran Sergi^ORCID,Swertz Morris A.^ORCID,Brookes Anthony J.,

Abstract

AbstractThe Solve-RD project brings together clinicians, scientists, and patient representatives from 51 institutes spanning 15 countries to collaborate on genetically diagnosing (“solving”) rare diseases (RDs). The project aims to significantly increase the diagnostic success rate by co-analysing data from thousands of RD cases, including phenotypes, pedigrees, exome/genome sequencing and multi-omics data. Here we report on the data infrastructure devised and created to support this co-analysis. This infrastructure enables users to store, find, connect, and analyse data and metadata in a collaborative manner. Pseudonymised phenotypic and raw experimental data are submitted to the RD-Connect Genome-Phenome Analysis Platform and processed through standardised pipelines. Resulting files and novel produced omics data are sent to the European Genome-phenome Archive, which adds unique file identifiers and provides long-term storage and controlled access services. MOLGENIS “RD3” and Café Variome “Discovery Nexus” connect data and metadata and offer discovery services, and secure cloud-based “Sandboxes” support multi-party data analysis. This proven infrastructure design provides a blueprint for other projects that need to analyse large amounts of heterogeneous data.

Publisher

Cold Spring Harbor Laboratory

Reference37 articles.

1. Solve-RD: systematic pan-European data sharing and collaborative analysis to solve rare diseases;EJHG,2021

2. The RD‐Connect Genome‐Phenome Analysis Platform: Accelerating diagnosis, research, and gene discovery for rare diseases

3. M.A. Swertz , et al. The MOLGENIS toolkit: rapid prototyping of biosoftware at the push of a button. BMC Bioinformatics. 11 Supp 12 (2010)

4. MOLGENIS research: advanced bioinformatics data software for non-bioinformaticians

5. Cafe Variome: General-Purpose Software for Making Genotype– Phenotype Data Discoverable in Restricted or Open Access Contexts;Human Mutation,2015