AlphaPeptDeep: A modular deep learning framework to predict peptide properties for proteomics-Reference-Cited by-同舟云学术

AlphaPeptDeep: A modular deep learning framework to predict peptide properties for proteomics

Published:2022-07-16 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Zeng Wen-Feng^ORCID,Zhou Xie-Xuan,Willems Sander^ORCID,Ammar Constantin,Wahle Maria,Bludau Isabell^ORCID,Voytik Eugenia^ORCID,Strauss Maximillian T.^ORCID,Mann Matthias^ORCID

Abstract

AbstractMachine learning and in particular deep learning (DL) are increasingly important in mass spectrometry (MS)-based proteomics. Recent DL models can predict the retention time, ion mobility and fragment intensities of a peptide just from the amino acid sequence with good accuracy. However, DL is a very rapidly developing field with new neural network architectures frequently appearing, which are challenging to incorporate for proteomics researchers. Here we introduce AlphaPeptDeep, a modular Python framework built on the PyTorch DL library that learns and predicts the properties of peptides (https://github.com/MannLabs/alphapeptdeep). It features a model shop that enables non-specialists to create models in just a few lines of code. AlphaPeptDeep represents post-translational modifications in a generic manner, even if only the chemical composition is known. Extensive use of transfer learning obviates the need for large data sets to refine models for particular experimental conditions. The AlphaPeptDeep models for predicting retention time, collisional cross sections and fragment intensities are at least on par with existing tools. Additional sequence-based properties can also be predicted by AlphaPeptDeep, as demonstrated with a novel HLA peptide prediction model to improve HLA peptide identification for data-independent acquisition.

Publisher

Cold Spring Harbor Laboratory

Reference58 articles.

1. Aebersold, R. & Mann, M. Mass-spectrometric exploration of proteome structure and function. Nature vol. 537 Preprint at https://doi.org/10.1038/nature19949 (2016).

2. The emerging role of mass spectrometry-based proteomics in drug discovery

3. Li, S. & Tang, H. Computational methods in mass spectrometry-based proteomics. in Advances in Experimental Medicine and Biology vol. 939 (2016).

4. Mann, M. , Kumar, C. , Zeng, W. F. & Strauss, M. T. Artificial intelligence for proteomics and biomarker discovery. Cell Systems vol. 12 Preprint at https://doi.org/10.1016/j.cels.2021.06.006 (2021).

5. Wen, B. et al. Deep Learning in Proteomics. Proteomics vol. 20 Preprint at https://doi.org/10.1002/pmic.201900335 (2020).

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. The potential of plasma HLA peptides beyond neoepitopes;2023-09-05

2. MSBooster: improving peptide identification rates using deep learning-based features;Nature Communications;2023-07-27

3. Synchro-PASEF allows precursor-specific fragment ion extraction and interference removal in data-independent acquisition;2022-11-01

4. MSBooster: Improving Peptide Identification Rates using Deep Learning-Based Features;2022-10-21