In-Pero: Exploiting Deep Learning Embeddings of Protein Sequences to Predict the Localisation of Peroxisomal Proteins-Reference-Cited by-同舟云学术

In-Pero: Exploiting Deep Learning Embeddings of Protein Sequences to Predict the Localisation of Peroxisomal Proteins

Published:2021-06-15 Issue:12 Volume:22 Page:6409
ISSN:1422-0067
Container-title:International Journal of Molecular Sciences
language:en
Short-container-title:IJMS

Author:

Anteghini Marco^ORCID,Martins dos Santos Vitor^ORCID,Saccenti Edoardo^ORCID

Abstract

Peroxisomes are ubiquitous membrane-bound organelles, and aberrant localisation of peroxisomal proteins contributes to the pathogenesis of several disorders. Many computational methods focus on assigning protein sequences to subcellular compartments, but there are no specific tools tailored for the sub-localisation (matrix vs. membrane) of peroxisome proteins. We present here In-Pero, a new method for predicting protein sub-peroxisomal cellular localisation. In-Pero combines standard machine learning approaches with recently proposed multi-dimensional deep-learning representations of the protein amino-acid sequence. It showed a classification accuracy above 0.9 in predicting peroxisomal matrix and membrane proteins. The method is trained and tested using a double cross-validation approach on a curated data set comprising 160 peroxisomal proteins with experimental evidence for sub-peroxisomal localisation. We further show that the proposed approach can be easily adapted (In-Mito) to the prediction of mitochondrial protein localisation obtaining performances for certain classes of proteins (matrix and inner-membrane) superior to existing tools.

Funder

H2020 Marie Skłodowska-Curie Actions

Publisher

MDPI AG

Subject

Inorganic Chemistry,Organic Chemistry,Physical and Theoretical Chemistry,Computer Science Applications,Spectroscopy,Molecular Biology,General Medicine,Catalysis

Link

https://www.mdpi.com/1422-0067/22/12/6409/pdf

Reference57 articles.

1. Alzheimer's Disease βA4 Protein Release and Amyloid Precursor Protein Sorting Are Regulated by Alternative Splicing

2. Localization and Post-Golgi Trafficking of Tumor Necrosis Factor-alpha in Macrophages

3. The ins and outs of E-cadherin trafficking

4. Adaptation of protein surfaces to subcellular location 1 1Edited by F. E. Cohen

5. Discrimination of Intracellular and Extracellular Proteins Using Amino Acid Composition and Residue-pair Frequencies

Cited by 20 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Protein subcellular localization prediction tools;Computational and Structural Biotechnology Journal;2024-12

2. PEL-PVP: Application of plant vacuolar protein discriminator based on PEFT ESM-2 and bilayer LSTM in an unbalanced dataset;International Journal of Biological Macromolecules;2024-10

3. SCLpred-ECL: Subcellular Localization Prediction by Deep N-to-1 Convolutional Neural Networks;International Journal of Molecular Sciences;2024-05-16

4. Protein sequence analysis in the context of drug repurposing;BMC Medical Informatics and Decision Making;2024-05-13

5. Prediction of Protein Localization;Reference Module in Life Sciences;2024