Affiliation:
1. West Coast Metabolomics Center, UC Davis, 451 Health Sciences Drive, Davis, CA 95616, USA
Abstract
Large-scale metabolomics assays are widely used in epidemiology for biomarker discovery and risk assessments. However, systematic errors introduced by instrumental signal drifting pose a big challenge in large-scale assays, especially for derivatization-based gas chromatography–mass spectrometry (GC–MS). Here, we compare the results of different normalization methods for a study with more than 4000 human plasma samples involved in a type 2 diabetes cohort study, in addition to 413 pooled quality control (QC) samples, 413 commercial pooled plasma samples, and a set of 25 stable isotope-labeled internal standards used for every sample. Data acquisition was conducted across 1.2 years, including seven column changes. In total, 413 pooled QC (training) and 413 BioIVT samples (validation) were used for normalization comparisons. Surprisingly, neither internal standards nor sum-based normalizations yielded median precision of less than 30% across all 563 metabolite annotations. While the machine-learning-based SERRF algorithm gave 19% median precision based on the pooled quality control samples, external cross-validation with BioIVT plasma pools yielded a median 34% relative standard deviation (RSD). We developed a new method: systematic error reduction by denoising autoencoder (SERDA). SERDA lowered the median standard deviations of the training QC samples down to 16% RSD, yielding an overall error of 19% RSD when applied to the independent BioIVT validation QC samples. This is the largest study on GC–MS metabolomics ever reported, demonstrating that technical errors can be normalized and handled effectively for this assay. SERDA was further validated on two additional large-scale GC–MS-based human plasma metabolomics studies, confirming the superior performance of SERDA over SERRF or sum normalizations.
Subject
Molecular Biology,Biochemistry,Endocrinology, Diabetes and Metabolism
Reference40 articles.
1. Sakaguchi, C.A., Nieman, D.C., Signini, E.F., Abreu, R.M., and Catai, A.M. (2019). Metabolomics-Based Studies Assessing Exercise-Induced Alterations of the Human Metabolome: A Systematic Review. Metabolites, 9.
2. HMDB: The Human Metabolome Database;Wishart;Nucleic Acids Res.,2007
3. Integration of GC–MS and LC–MS for Untargeted Metabolomics Profiling;Zeki;J. Pharm. Biomed. Anal.,2020
4. Normalizing and Integrating Metabolomics Data;Dias;Anal. Chem.,2012
5. Metabolite Fingerprinting: Detecting Biological Features by Independent Component Analysis;Scholz;Bioinformatics,2004
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献