Abstract
AbstractAdvances in mass spectrometry (MS) have enabled high-throughput analysis of proteomes in biological systems. The state-of-the-art MS data analysis relies on database search algorithms to quantify proteins by identifying peptide-spectrum matches (PSMs), which convert mass spectra to peptide sequences. Different database search algorithms use distinct search strategies and thus may identify unique PSMs. However, no existing approaches can aggregate all user-specified database search algorithms with guaranteed control on the false discovery rate (FDR) and guaranteed increase in the identified peptides. To fill in this gap, we propose a statistical framework, Aggregation of Peptide Identification Results (APIR), that is universally compatible with all database search algorithms. Notably, under a target FDR threshold, APIR is guaranteed to identify at least as many, if not more, peptides as individual database search algorithms do. Evaluation of APIR on a complex protein standard shows that APIR outpowers individual database search algorithms and guarantees the FDR control. Real data studies show that APIR can identify disease-related proteins and post-translational modifications missed by some individual database search algorithms. Note that the APIR framework is easily extendable to aggregating discoveries made by multiple algorithms in other high-throughput biomedical data analysis, e.g., differential gene expression analysis on RNA sequencing data.
Publisher
Cold Spring Harbor Laboratory
Reference108 articles.
1. Oscar Alzate. Neuroproteomics. CRC Press, 2009.
2. Proteomic contributions to personalized cancer care;In: Molecular & Cellular Proteomics,2008
3. Proteomics reveals NNMT as a master metabolic regulator of cancerassociated fibroblasts;In: Nature,2019
4. Clinical proteomics of breast cancer reveals a novel layer of breast cancer classification;In: Cancer research,2018
5. Multidimensional separations-based shotgun proteomics;In: Chemical reviews,2007
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献