Affiliation:
1. VIB - UGent Center for medical biotechnology
2. VIB
3. European Bioinformatics Institute
4. European Bioinformatics Institute (EMBL-EBI)
5. VIB-UGent
Abstract
Abstract
Sharing data and resources has revolutionized life sciences, particularly in proteomics, where public data has enabled researchers to reanalyze and reinterpret data in novel ways. However, the lack of comprehensive metadata remains a significant challenge to unlocking the full potential of publicly shared data. In response, the Sample and Data Relationship Format (SDRF) Proteomics was developed, However, its complexity presents several challenges. This study investigated metadata annotations in proteomics data sets from the PRIDE database and the corresponding publications, and identified major gaps in metadata provision. To bridge this gap, we created a user-friendly, ontology-based Streamlit application, named lesSDRF, that guides users through the annotation process using SDRF. lesSDRF aims to encourage researchers to provide more detailed metadata annotations, leading to greater insights and scientific advances in proteomics. By addressing this issue, we can facilitate more collaborative efforts and enhance our understanding of biological processes. LesSDRF is available via https://compomics-lessdrf-home-2rdf84.streamlit.app/.
Publisher
Research Square Platform LLC
Reference42 articles.
1. The Protein Data Bank. A computer-based archival file for macromolecular structures;Bernstein FC;Eur. J. Biochem.,1977
2. Initial sequencing and analysis of the human genome;Lander ES;Nature,2001
3. Highly accurate protein structure prediction for the human proteome;Tunyasuvunakool K;Nature,2021
4. DeepMind AI cracks 50-year-old problem of protein folding | DeepMind | The Guardian. https://www.theguardian.com/technology/2020/nov/30/deepmind-ai-cracks-50-year-old-problem-of-biology-research.
5. The FAIR Guiding Principles for scientific data management and stewardship;Wilkinson MD;Sci. Data,2016