A new statistical workflow (R-packages based) to investigate associations between one variable of interest and the metabolome
-
Published:2023-11-30
Issue:1
Volume:20
Page:
-
ISSN:1573-3890
-
Container-title:Metabolomics
-
language:en
-
Short-container-title:Metabolomics
Author:
Ferrario Paola G.,Bub Achim,Frommherz Lara,Krüger Ralf,Rist Manuela J.,Watzl Bernhard
Abstract
Abstract
Introduction
In metabolomics, the investigation of associations between the metabolome and one trait of interest is a key research question. However, statistical analyses of such associations are often challenging. Statistical tools enabling resilient verification and clear presentation are therefore highly desired.
Objectives
Our aim is to provide a contribution for statistical analysis of metabolomics data, offering a widely applicable open-source statistical workflow, which considers the intrinsic complexity of metabolomics data.
Methods
We combined selected R packages tailored for all properties of heterogeneous metabolomics datasets, where metabolite parameters typically (i) are analyzed in different matrices, (ii) are measured on different analytical platforms with different precision, (iii) are analyzed by targeted as well as non-targeted methods, (iv) are scaled variously, (v) reveal heterogeneous variances, (vi) may be correlated, (vii) may have only few values or values below a detection limit, or (viii) may be incomplete.
Results
The code is shared entirely and freely available. The workflow output is a table of metabolites associated with a trait of interest and a compact plot for high-quality results visualization. The workflow output and its utility are presented by applying it to two previously published datasets: one dataset from our own lab and another dataset taken from the repository MetaboLights.
Conclusion
Robustness and benefits of the statistical workflow were clearly demonstrated, and everyone can directly re-use it for analysis of own data.
Funder
Max Rubner-Institut, Bundesforschungsinstitut für Ernährung und Lebensmittel
Publisher
Springer Science and Business Media LLC
Subject
Clinical Biochemistry,Biochemistry,Endocrinology, Diabetes and Metabolism
Reference21 articles.
1. Antonelli, J., Claggett, BL., Henglin, M., Kim, A., Ovsak, G., Kim, N. (2019) Statistical workflow for feature selection in human metabolomics data. Metabolites., 9(7), 143. 2. Bub, A., Kriebel, A., Dorr, C., Bandt, S., Rist, M., & Roth, A. (2016). The Karlsruhe Metabolomics and Nutrition (KarMeN) Study: Protocol and methods of a cross-sectional study to characterize the metabolome of healthy men and women. JMIR Research Protocols., 5(3), e146. 3. Chong, J., Soufan, O., Li, C., Caraus, I., Li, S., Bourque, G. (2018) MetaboAnalyst 4.0: towards more transparent and integrative metabolomics analysis. Nucleic Acids Research., 46(W1), W486-94. 4. Considine, E. C., Thomas, G., Boulesteix, A. L., Khashan, A. S., & Kenny, L. C. (2018). Critical review of reporting of the data analysis step in metabolomics. Metabolomics., 14, 7 5. Davidson, RL., Weber, RJ., Liu, H., Sharma-Oates, A., Viant MR. (2016) Galaxy-M: A Galaxy workflow for processing and analyzing direct infusion and liquid chromatography mass spectrometry-based metabolomics data. GigaScience, 23(5), 10.
|
|