A semantic proteomics dashboard (SemPoD) for data management in translational research-Reference-Cited by-同舟云学术

A semantic proteomics dashboard (SemPoD) for data management in translational research

Published:2012-12 Issue:S3 Volume:6 Page:
ISSN:1752-0509
Container-title:BMC Systems Biology
language:en
Short-container-title:BMC Syst Biol

Author:

Jayapandian Catherine P,Zhao Meng,Ewing Rob M,Zhang Guo-Qiang,Sahoo Satya S

Abstract

Abstract Background One of the primary challenges in translational research data management is breaking down the barriers between the multiple data silos and the integration of 'omics data with clinical information to complete the cycle from the bench to the bedside. The role of contextual metadata, also called provenance information, is a key factor ineffective data integration, reproducibility of results, correct attribution of original source, and answering research queries involving "W hat", "W here", "W hen", "W hich", "W ho", "How", and "W hy" (also known as the W7 model). But, at present there is limited or no effective approach to managing and leveraging provenance information for integrating data across studies or projects. Hence, there is an urgent need for a paradigm shift in creating a "provenance-aware" informatics platform to address this challenge. We introduce an ontology-driven, intuitive Sem antic P ro teomics D ashboard (SemPoD) that uses provenance together with domain information (semantic provenance) to enable researchers to query, compare, and correlate different types of data across multiple projects, and allow integration with legacy data to support their ongoing research. Results The SemPoD platform, currently in use at the Case Center for Proteomics and Bioinformatics (CPB), consists of three components: (a) Ontology-driven Visual Query Composer, (b) Result Explorer, and (c) Query Manager. Currently, SemPoD allows provenance-aware querying of 1153 mass-spectrometry experiments from 20 different projects. SemPod uses the systems molecular biology provenance ontology (SysPro) to support a dynamic query composition interface, which automatically updates the components of the query interface based on previous user selections and efficientlyprunes the result set usinga "smart filtering" approach. The SysPro ontology re-uses terms from the PROV-ontology (PROV-O) being developed by the World Wide Web Consortium (W3C) provenance working group, the minimum information required for reporting a molecular interaction experiment (MIMIx), and the minimum information about a proteomics experiment (MIAPE) guidelines. The SemPoD was evaluated both in terms of user feedback and as scalability of the system. Conclusions SemPoD is an intuitive and powerful provenance ontology-driven data access and query platform that uses the MIAPE and MIMIx metadata guideline to create an integrated view over large-scale systems molecular biology datasets. SemPoD leverages the SysPro ontology to create an intuitive dashboard for biologists to compose queries, explore the results, and use a query manager for storing queries for later use. SemPoD can be deployed over many existing database applications storing 'omics data, including, as illustrated here, the LabKey data-management system. The initial user feedback evaluating the usability and functionality of SemPoD has been very positive and it is being considered for wider deployment beyond the proteomics domain, and in other 'omics' centers.

Publisher

Springer Science and Business Media LLC

Subject

Applied Mathematics,Computer Science Applications,Molecular Biology,Modelling and Simulation,Structural Biology

Link

https://link.springer.com/content/pdf/10.1186/1752-0509-6-S3-S20.pdf

Reference19 articles.

1. Editorial-Introduction: Challenges and Opportunities. Science. 2011, 331 (6018): 692-692.

2. Editorial: Integrating with integrity. Nat Genet. 2010, 42 (1): 1-

3. Goble C: Position Statement: Musings on Provenance, Workflow and (Semantic Web) Annotations for Bioinformatics. Workshop on Data Derivation and Provenance: 2002; Chicago. 2002

4. Sahoo SS, Nguyen V, Bodenreider O, Parikh P, Minning T, Sheth AP: A unified framework for managing provenance information in translational research. BMC Bioinformatics. 2011, 12: 461-10.1186/1471-2105-12-461.

5. Lee T, Bressan S: Multimodal Integration of Disparate Information Sources with Attribution. Entity Relationship Workshop on Information Retrieval and Conceptual Modeling. 1997

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Provenance Information for Biomedical Data and Workflows: Scoping Review;Journal of Medical Internet Research;2024-08-23

2. Developing and validating interoperable ontology-driven game-based assessments;Expert Systems with Applications;2024-08

3. Provenance Information for Biomedical Data and Workflows: Scoping Review (Preprint);2023-07-27

4. Capturing provenance information for biomedical data and workflows: A scoping review;2023-02-09

5. Approaches and Criteria for Provenance in Biomedical Data Sets and Workflows: Protocol for a Scoping Review;JMIR Research Protocols;2021-11-22