AHLF: ad hoc learning of peptide fragmentation from mass spectra enables an interpretable detection of phosphorylated and cross-linked peptides-Reference-Cited by-同舟云学术

AHLF: ad hoc learning of peptide fragmentation from mass spectra enables an interpretable detection of phosphorylated and cross-linked peptides

Published:2020-05-21 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Altenburg Tom^ORCID,Giese Sven^ORCID,Wang Shengbo^ORCID,Muth Thilo^ORCID,Renard Bernhard Y.^ORCID

Abstract

AbstractMass spectrometry-based proteomics provides a holistic snapshot of the entire protein set of a living cell on a molecular level. Currently, only a few deep learning approaches that involve peptide fragmentation spectra, which represent partial sequence information of proteins, exist. Commonly, these approaches lack the ability to characterize less studied or even unknown patterns in spectra because of their use of explicit domain knowledge. To elevate unrestricted learning from spectra, we introduce AHLF, a deep learning model that is end-to-end trained on 19.2 million spectra from multiple phosphoproteomic data sets. AHLF is interpretable and we show that peak-level feature importances and pairwise interactions between peaks are in line with corresponding peptide fragments. We demonstrate our approach by detecting post-translational modifications, specifically protein phosphorylation based on only the fragmentation spectrum without a database search. AHLF increases the area under the receiver operating characteristic curve (AUC) by an average of 9.4% on recent phosphoproteomic data compared to the current-state-of-the-art on this task. To show the broad applicability of AHLF we use transfer learning to also detect cross-linked peptides, as used in protein structure analysis, with an AUC of up to 94%. We expect our approach to directly apply to cell signaling and structural biology which use phosphoproteomic and cross-linking data, but in principal any mass spectrometry based study can benefit from an interpretable, end-to-end trained model like AHLF.Availability

https://gitlab.com/dacs-hpi/ahlf

Contactbernhard.renard@hpi.de

Publisher

Cold Spring Harbor Laboratory

Reference55 articles.

1. A community proposal to integrate proteomics activities in ELIXIR;F1000Research,2017

2. Mass-spectrometric exploration of proteome structure and function

3. Analysis and validation of proteomic data generated by tandem mass spectrometry;Nature Methods,2007

4. David Ochoa , Andrew F. Jarnuczak , Cristina Viéitez , Maja Gehre , Margaret Soucheray , André Mateus , Askar A. Kleefeldt , Anthony Hill , Luz Garcia-Alonso , Frank Stein , Nevan J. Krogan , Mikhail M. Savitski , Danielle L. Swaney , Juan A. Vizcaíno , Kyung-Min Noh , and Pedro Beltrao . The functional landscape of the human phosphoproteome. Nature Biotechnology, dec 2019.

5. Systematic Discovery of In Vivo Phosphorylation Networks

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. yHydra: Deep Learning enables an Ultra Fast Open Search by Jointly Embedding MS/MS Spectra and Peptides of Mass Spectrometry-based Proteomics;2021-12-03