Abstract
Background
Missing values are a key issue in the statistical analysis of proteomic data. Defining the strategy to address missing values is a complex task in each study, potentially affecting the quality of statistical analyses.
Results
We have developed OptiMissP, a dashboard to visually and qualitatively evaluate missingness and guide decision making in the handling of missing values in proteomics studies that use data-independent acquisition mass spectrometry. It provides a set of visual tools to retrieve information about missingness through protein densities and topology-based approaches, and facilitates exploration of different imputation methods and missingness thresholds.
Conclusions
OptiMissP provides support for researchers’ and clinicians’ qualitative assessment of missingness in proteomic datasets in order to define study-specific strategies for the handling of missing values. OptiMissP considers biases in protein distributions related to the choice of imputation method and helps analysts to balance the information loss caused by low missingness thresholds and the noise introduced by selecting high missingness thresholds. This is complemented by topological data analysis which provides additional insight to the structure of the data and their missingness. We use an example in Chronic Kidney Disease to illustrate the main functionalities of OptiMissP.
Funder
Medical Research Council
CRUK Manchester Centre
NIHR Manchester Biomedical Research Centre
Publisher
Public Library of Science (PLoS)
Reference27 articles.
1. SWATH mass spectrometry as a tool for quantitative profiling of the matrisome;L Krasny;J Proteomics,2018
2. Protein quantitation using iTRAQ: Review on the sources of variations and analysis of nonrandom missingness;R Luo;Stat Interface,2012
3. Missing Value Imputation Approach for Mass Spectrometry-based Metabolomics Data;R Wei;Sci Rep,2018
4. Normalization and missing value imputation for label-free LC-MS analysis;Y V. Karpievitch;BMC Bioinformatics,2012
5. GMSimpute: A generalized two-step Lasso approach to impute missing values in label-free mass spectrum analysis;Q Li;Bioinformatics,2020
Cited by
8 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献