Assessment of label-free quantification and missing value imputation for proteomics in non-human primates-Reference-Cited by-同舟云学术

Assessment of label-free quantification and missing value imputation for proteomics in non-human primates

Published:2022-07-08 Issue:1 Volume:23 Page:
ISSN:1471-2164
Container-title:BMC Genomics
language:en
Short-container-title:BMC Genomics

Author:

Hamid Zeeshan,Zimmerman Kip D.,Guillen-Ahlers Hector,Li Cun,Nathanielsz Peter,Cox Laura A.,Olivier Michael

Abstract

Abstract Background Reliable and effective label-free quantification (LFQ) analyses are dependent not only on the method of data acquisition in the mass spectrometer, but also on the downstream data processing, including software tools, query database, data normalization and imputation. In non-human primates (NHP), LFQ is challenging because the query databases for NHP are limited since the genomes of these species are not comprehensively annotated. This invariably results in limited discovery of proteins and associated Post Translational Modifications (PTMs) and a higher fraction of missing data points. While identification of fewer proteins and PTMs due to database limitations can negatively impact uncovering important and meaningful biological information, missing data also limits downstream analyses (e.g., multivariate analyses), decreases statistical power, biases statistical inference, and makes biological interpretation of the data more challenging. In this study we attempted to address both issues: first, we used the MetaMorphues proteomics search engine to counter the limits of NHP query databases and maximize the discovery of proteins and associated PTMs, and second, we evaluated different imputation methods for accurate data inference. We used a generic approach for missing data imputation analysis without distinguising the potential source of missing data (either non-assigned m/z or missing values across runs). Results Using the MetaMorpheus proteomics search engine we obtained quantitative data for 1622 proteins and 10,634 peptides including 58 different PTMs (biological, metal and artifacts) across a diverse age range of NHP brain frontal cortex. However, among the 1622 proteins identified, only 293 proteins were quantified across all samples with no missing values, emphasizing the importance of implementing an accurate and statiscaly valid imputation method to fill in missing data. In our imputation analysis we demonstrate that Single Imputation methods that borrow information from correlated proteins such as Generalized Ridge Regression (GRR), Random Forest (RF), local least squares (LLS), and a Bayesian Principal Component Analysis methods (BPCA), are able to estimate missing protein abundance values with great accuracy. Conclusions Overall, this study offers a detailed comparative analysis of LFQ data generated in NHP and proposes strategies for improved LFQ in NHP proteomics data.

Publisher

Springer Science and Business Media LLC

Subject

Genetics,Biotechnology

Link

https://link.springer.com/content/pdf/10.1186/s12864-022-08723-1.pdf

Reference29 articles.

1. Moulder R, Goo YA, Goodlett DR. Label-free quantitation for clinical proteomics. Methods Mol Biol. 2016;1410:65–76.

2. Filiou MD, Martins-de-Souza D, Guest PC, Bahn S, Turck CW. To label or not to label: applications of quantitative proteomics in neuroscience research. Proteomics. 2012;12(4–5):736–47.

3. Wang M, You J, Bemis KG, Tegeler TJ, Brown DP. Label-free mass spectrometry-based protein quantification technologies in proteomic analysis. Brief Funct Genomic Proteomic. 2008;7(5):329–39.

4. Proffitt JM, Glenn J, Cesnik AJ, Jadhav A, Shortreed MR, Smith LM, et al. Proteomics in non-human primates: utilizing RNA-Seq data to improve protein identification by mass spectrometry in vervet monkeys. BMC Genomics. 2017;18(1):877.

5. Lazar C, Gatto L, Ferro M, Bruley C, Burger T. Accounting for the multiple natures of missing values in label-free quantitative proteomics data sets to compare imputation strategies. J Proteome Res. 2016;15(4):1116–25.

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Proteomes of plasmodium knowlesi early and late ring-stage parasites and infected host erythrocytes;Journal of Proteomics;2024-06

2. Proteomics—The State of the Field: The Definition and Analysis of Proteomes Should Be Based in Reality, Not Convenience;Proteomes;2024-04-19

3. Integrated multi-omics analysis of brain aging in female nonhuman primates reveals altered signaling pathways relevant to age-related disorders;Neurobiology of Aging;2023-12

4. Multi-omics Analysis of Aging Liver Reveals Changes in Endoplasmic Stress and Degradation Pathways in Female Nonhuman Primates;2023-08-22

5. Dealing with missing values in proteomics data;PROTEOMICS;2022-11-17