Beam search decoder for enhancing sequence decoding speed in single-molecule peptide sequencing data
-
Published:2023-11-07
Issue:11
Volume:19
Page:e1011345
-
ISSN:1553-7358
-
Container-title:PLOS Computational Biology
-
language:en
-
Short-container-title:PLoS Comput Biol
Author:
Kipen JavierORCID,
Jaldén Joakim
Abstract
Next-generation single-molecule protein sequencing technologies have the potential to significantly accelerate biomedical research. These technologies offer sensitivity and scalability for proteomic analysis. One auspicious method is fluorosequencing, which involves: cutting naturalized proteins into peptides, attaching fluorophores to specific amino acids, and observing variations in light intensity as one amino acid is removed at a time. The original peptide is classified from the sequence of light-intensity reads, and proteins can subsequently be recognized with this information. The amino acid step removal is achieved by attaching the peptides to a wall on the C-terminal and using a process called Edman Degradation to remove an amino acid from the N-Terminal. Even though a framework (Whatprot) has been proposed for the peptide classification task, processing times remain restrictive due to the massively parallel data acquisicion system. In this paper, we propose a new beam search decoder with a novel state formulation that obtains considerably lower processing times at the expense of only a slight accuracy drop compared to Whatprot. Furthermore, we explore how our novel state formulation may lead to even faster decoders in the future.
Funder
Svenska Forskningsrådet Formas
Stiftelsen för Strategisk Forskning
Publisher
Public Library of Science (PLoS)
Subject
Computational Theory and Mathematics,Cellular and Molecular Neuroscience,Genetics,Molecular Biology,Ecology,Modeling and Simulation,Ecology, Evolution, Behavior and Systematics
Reference18 articles.
1. Eisenstein M. Seven technologies to watch in 2023; 2023. Available from: https://www.nature.com/articles/d41586-023-00178-y.
2. Paving the way to single-molecule protein sequencing;L Restrepo-Pérez;Nature Nanotechnology,2018
3. Understudied proteins: opportunities and challenges for functional proteomics;G Kustatscher;Nature Methods,2022
4. The emerging landscape of single-molecule protein sequencing technologies;JA Alfaro;Nature methods,2021
5. Single-Cell Proteomics;LF Vistain;Trends in Biochemical Sciences,2021