moPepGen: Rapid and Comprehensive Identification of Non-canonical Peptides-Reference-Cited by-同舟云学术

moPepGen: Rapid and Comprehensive Identification of Non-canonical Peptides

Published:2024-03-31 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Zhu Chenghao^ORCID,Liu Lydia Y^ORCID,Ha Annie^ORCID,Yamaguchi Takafumi N^ORCID,Zhu Helen,Hugh-White Rupert^ORCID,Livingstone Julie^ORCID,Patel Yash^ORCID,Kislinger Thomas^ORCID,Boutros Paul C^ORCID

Abstract

Gene expression is a multi-step transformation of biological information from its storage form (DNA) into functional forms (protein and some RNAs). Regulatory activities at each step of this transformation multiply a single gene into a myriad of proteoforms. Proteogenomics is the study of how genomic and transcriptomic variation creates this proteomic diversity, and is limited by the challenges of modeling the complexities of gene-expression. We therefore created moPepGen, a graph-based algorithm that comprehensively generates non-canonical peptides in linear time. moPepGen works with multiple technologies, in multiple species and on all types of genetic and transcriptomic data. In human cancer proteomes, it enumerates previously unobservable noncanonical peptides arising from germline and somatic genomic variants, noncoding open reading frames, RNA fusions and RNA circularization. By enabling efficient detection and quantitation of previously hidden proteins in both existing and new proteomic data, moPepGen facilitates all proteogenomics applications. It is available at: https://github.com/uclahs-cds/package-moPepGen.

Publisher

Cold Spring Harbor Laboratory

Reference68 articles.

1. Proteogenomic characterization of human colon and rectal cancer

2. Global detection of human variants and isoforms by deep proteome sequencing

3. Expansion of the eukaryotic proteome by alternative splicing

4. A-to-I RNA Editing Contributes to Proteomic Diversity in Cancer

5. A mass-tolerant database search identifies a large proportion of unassigned spectra in shotgun proteomics as modified peptides