lesSDRF Is More: Maximizing The Value Of Proteomics Data Through Streamlined Metadata Annotation-Reference-Cited by-同舟云学术

lesSDRF Is More: Maximizing The Value Of Proteomics Data Through Streamlined Metadata Annotation

Published:2023-05-23 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Claeys Tine¹,Bossche Tim Van Den²^ORCID,Perez-Riverol Yasset³^ORCID,Gevaert Kris,Vizcaino Juan Antonio⁴^ORCID,Martens Lennart⁵^ORCID

Affiliation:

1. VIB - UGent Center for medical biotechnology

2. VIB

3. European Bioinformatics Institute

4. European Bioinformatics Institute (EMBL-EBI)

5. VIB-UGent

Abstract

Abstract Sharing data and resources has revolutionized life sciences, particularly in proteomics, where public data has enabled researchers to reanalyze and reinterpret data in novel ways. However, the lack of comprehensive metadata remains a significant challenge to unlocking the full potential of publicly shared data. In response, the Sample and Data Relationship Format (SDRF) Proteomics was developed, However, its complexity presents several challenges. This study investigated metadata annotations in proteomics data sets from the PRIDE database and the corresponding publications, and identified major gaps in metadata provision. To bridge this gap, we created a user-friendly, ontology-based Streamlit application, named lesSDRF, that guides users through the annotation process using SDRF. lesSDRF aims to encourage researchers to provide more detailed metadata annotations, leading to greater insights and scientific advances in proteomics. By addressing this issue, we can facilitate more collaborative efforts and enhance our understanding of biological processes. LesSDRF is available via https://compomics-lessdrf-home-2rdf84.streamlit.app/.

Publisher

Research Square Platform LLC

Reference42 articles.

1. The Protein Data Bank. A computer-based archival file for macromolecular structures;Bernstein FC;Eur. J. Biochem.,1977

2. Initial sequencing and analysis of the human genome;Lander ES;Nature,2001

3. Highly accurate protein structure prediction for the human proteome;Tunyasuvunakool K;Nature,2021

4. DeepMind AI cracks 50-year-old problem of protein folding | DeepMind | The Guardian. https://www.theguardian.com/technology/2020/nov/30/deepmind-ai-cracks-50-year-old-problem-of-biology-research.

5. The FAIR Guiding Principles for scientific data management and stewardship;Wilkinson MD;Sci. Data,2016