rox: A Statistical Model for Regression with Missing Values-Reference-Cited by-同舟云学术

rox: A Statistical Model for Regression with Missing Values

Published:2023-01-13 Issue:1 Volume:13 Page:127
ISSN:2218-1989
Container-title:Metabolites
language:en
Short-container-title:Metabolites

Author:

Buyukozkan Mustafa,Benedetti Elisa,Krumsiek Jan^ORCID

Abstract

High-dimensional omics datasets frequently contain missing data points, which typically occur due to concentrations below the limit of detection (LOD) of the profiling platform. The presence of such missing values significantly limits downstream statistical analysis and result interpretation. Two common techniques to deal with this issue include the removal of samples with missing values and imputation approaches that substitute the missing measurements with reasonable estimates. Both approaches, however, suffer from various shortcomings and pitfalls. In this paper, we present “rox”, a novel statistical model for the analysis of omics data with missing values without the need for imputation. The model directly incorporates missing values as “low” concentrations into the calculation. We show the superiority of rox over common approaches on simulated data and on six metabolomics datasets. Fully leveraging the information contained in LOD-based missing values, rox provides a powerful tool for the statistical analysis of omics data.

Publisher

MDPI AG

Subject

Molecular Biology,Biochemistry,Endocrinology, Diabetes and Metabolism

Link

https://www.mdpi.com/2218-1989/13/1/127/pdf

Reference34 articles.

1. A comparative study of evaluating missing value imputation methods in label-free proteomics;Jin;Sci. Rep.,2021

2. Analysis of microbial compositions: A review of normalization and differential abundance analysis;Lin;NPJ Biofilms Microbiomes,2020

3. Characterization of missing values in untargeted MS-based metabolomics data and evaluation of missing data handling strategies;Do;Metabolomics,2018

4. Human metabolic individuality in biomedical and pharmaceutical research;Suhre;Nature,2011

5. Microbiome Datasets Are Compositional: And This Is Not Optional;Gloor;Front. Microbiol.,2017

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A roadmap to the molecular human linking multiomics with population traits and diabetes subtypes;Nature Communications;2024-08-19