Probabilistic principal component analysis for metabolomic data-Reference-Cited by-同舟云学术

Probabilistic principal component analysis for metabolomic data

Published:2010-11-23 Issue:1 Volume:11 Page:
ISSN:1471-2105
Container-title:BMC Bioinformatics
language:en
Short-container-title:BMC Bioinformatics

Author:

Nyamundanda Gift,Brennan Lorraine,Gormley Isobel Claire

Abstract

Abstract Background Data from metabolomic studies are typically complex and high-dimensional. Principal component analysis (PCA) is currently the most widely used statistical technique for analyzing metabolomic data. However, PCA is limited by the fact that it is not based on a statistical model. Results Here, probabilistic principal component analysis (PPCA) which addresses some of the limitations of PCA, is reviewed and extended. A novel extension of PPCA, called probabilistic principal component and covariates analysis (PPCCA), is introduced which provides a flexible approach to jointly model metabolomic data and additional covariate information. The use of a mixture of PPCA models for discovering the number of inherent groups in metabolomic data is demonstrated. The jackknife technique is employed to construct confidence intervals for estimated model parameters throughout. The optimal number of principal components is determined through the use of the Bayesian Information Criterion model selection tool, which is modified to address the high dimensionality of the data. Conclusions The methods presented are illustrated through an application to metabolomic data sets. Jointly modeling metabolomic data and covariates was successfully achieved and has the potential to provide deeper insight to the underlying data structure. Examination of confidence intervals for the model parameters, such as loadings, allows for principled and clear interpretation of the underlying data structure. A software package called MetabolAnalyze, freely available through the R statistical software, has been developed to facilitate implementation of the presented methods in the metabolomics field.

Publisher

Springer Science and Business Media LLC

Subject

Applied Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Structural Biology

Link

https://link.springer.com/content/pdf/10.1186/1471-2105-11-571.pdf

Reference34 articles.

1. Brennan L: Session 2: Personalised nutrition. Metabolomic applications in nutritional research. Proceedings of the Nutrition Society 2008, 67(4):404–408. 10.1017/S0029665108008719

2. Keun HC: Metabonomic modeling of drug toxicity. Pharmacology and Therapeutics 2006, 109(12):92–106. 10.1016/j.pharmthera.2005.06.008

3. Gibney MJ, Walsh M, Brennan L, Roche HM, German B, van Ommen B: Metabolomics in human nutrition: opportunities and challenges. American Journal of Clinical Nutrition 2005, 82(3):497–503.

4. Reo NV: Metabonomics based on NMR spectroscopy. Drug and Chemical Toxicology 2002, 25(4):375–382. 10.1081/DCT-120014789

5. Dettmer K, Aronov PA, Hammock BD: Mass spectrometry-based metabolomics. Mass Spectrometry Reviews 2007, 26: 51–78. 10.1002/mas.20108

Cited by 127 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Benchmarking feature selection and feature extraction methods to improve the performances of machine-learning algorithms for patient classification using metabolomics biomedical data;Computational and Structural Biotechnology Journal;2024-12

2. Machine learning for the advancement of genome-scale metabolic modeling;Biotechnology Advances;2024-09

3. ML-based clinical decision support models based on metabolomics data;TrAC Trends in Analytical Chemistry;2024-09

4. Impact of thermal, high‐pressure and ultra‐shear pasteurisation technologies on beetroot juice metabolites using untargeted nuclear magnetic resonance spectroscopy;International Journal of Food Science & Technology;2024-07-08

5. Campylobacterinfection of young children in Colombia and its impact on the gastrointestinal environment;2024-05-07