Optimization of Imputation Strategies for High-Resolution Gas Chromatography–Mass Spectrometry (HR GC–MS) Metabolomics Data-Reference-Cited by-同舟云学术

Optimization of Imputation Strategies for High-Resolution Gas Chromatography–Mass Spectrometry (HR GC–MS) Metabolomics Data

Published:2022-05-11 Issue:5 Volume:12 Page:429
ISSN:2218-1989
Container-title:Metabolites
language:en
Short-container-title:Metabolites

Author:

Ampong Isaac^ORCID,Zimmerman Kip D.^ORCID,Nathanielsz Peter W.,Cox Laura A.^ORCID,Olivier Michael^ORCID

Abstract

Gas chromatography–coupled mass spectrometry (GC–MS) has been used in biomedical research to analyze volatile, non-polar, and polar metabolites in a wide array of sample types. Despite advances in technology, missing values are still common in metabolomics datasets and must be properly handled. We evaluated the performance of ten commonly used missing value imputation methods with metabolites analyzed on an HR GC–MS instrument. By introducing missing values into the complete (i.e., data without any missing values) National Institute of Standards and Technology (NIST) plasma dataset, we demonstrate that random forest (RF), glmnet ridge regression (GRR), and Bayesian principal component analysis (BPCA) shared the lowest root mean squared error (RMSE) in technical replicate data. Further examination of these three methods in data from baboon plasma and liver samples demonstrated they all maintained high accuracy. Overall, our analysis suggests that any of the three imputation methods can be applied effectively to untargeted metabolomics datasets with high accuracy. However, it is important to note that imputation will alter the correlation structure of the dataset and bias downstream regression coefficients and p-values.

Funder

National Institutes of Health

Publisher

MDPI AG

Subject

Molecular Biology,Biochemistry,Endocrinology, Diabetes and Metabolism

Link

https://www.mdpi.com/2218-1989/12/5/429/pdf

Reference34 articles.

1. A Workflow for Missing Values Imputation of Untargeted Metabolomics Data

2. Analytical techniques for metabolomic studies: a review

3. Emerging Applications of Metabolomics in Clinical Pharmacology

4. Power of metabolomics in biomarker discovery and mining mechanisms of obesity

5. Integrating clinical metabolomics-based biomarker discovery and clinical pharmacology to enable precision medicine

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Integrated multi-omics analysis of brain aging in female nonhuman primates reveals altered signaling pathways relevant to age-related disorders;Neurobiology of Aging;2023-12

2. Maternal obesity alters offspring liver and skeletal muscle metabolism in early post‐puberty despite maintaining a normal post‐weaning dietary lifestyle;The FASEB Journal;2022-11-23

3. Improved GSimp: A Flexible Missing Value Imputation Method to Support Regulatory Bioequivalence Assessment;Annals of Biomedical Engineering;2022-09-15

4. Imputation of Missing Values for Multi-Biospecimen Metabolomics Studies: Bias and Effects on Statistical Validity;Metabolites;2022-07-21