Improvement of Epitope Prediction Using Peptide Sequence Descriptors and Machine Learning-Reference-Cited by-同舟云学术

Improvement of Epitope Prediction Using Peptide Sequence Descriptors and Machine Learning

Published:2019-09-05 Issue:18 Volume:20 Page:4362
ISSN:1422-0067
Container-title:International Journal of Molecular Sciences
language:en
Short-container-title:IJMS

Author:

Munteanu Cristian R.^ORCID,Gestal Marcos,Martínez-Acevedo Yunuen G.,Pedreira Nieves,Pazos Alejandro^ORCID,Dorado Julián^ORCID

Abstract

In this work, we improved a previous model used for the prediction of proteomes as new B-cell epitopes in vaccine design. The predicted epitope activity of a queried peptide is based on its sequence, a known reference epitope sequence under specific experimental conditions. The peptide sequences were transformed into molecular descriptors of sequence recurrence networks and were mixed under experimental conditions. The new models were generated using 709,100 instances of pair descriptors for query and reference peptide sequences. Using perturbations of the initial descriptors under sequence or assay conditions, 10 transformed features were used as inputs for seven Machine Learning methods. The best model was obtained with random forest classifiers with an Area Under the Receiver Operating Characteristics (AUROC) of 0.981 ± 0.0005 for the external validation series (five-fold cross-validation). The database included information about 83,683 peptides sequences, 1448 epitope organisms, 323 host organisms, 15 types of in vivo processes, 28 experimental techniques, and 505 adjuvant additives. The current model could improve the in silico predictions of epitopes for vaccine design. The script and results are available as a free repository.

Funder

Instituto de Salud Carlos III

Drug Discovery Galician Network

Basque government

Publisher

MDPI AG

Subject

Inorganic Chemistry,Organic Chemistry,Physical and Theoretical Chemistry,Computer Science Applications,Spectroscopy,Molecular Biology,General Medicine,Catalysis

Link

https://www.mdpi.com/1422-0067/20/18/4362/pdf

Reference31 articles.

1. Proteomics data mining

2. T-cell epitope vaccine design by immunoinformatics

3. Performance of two Bm86 antigen vaccin formulation against tick using crossbreed bovines in stall test;Andreotti;Rev. Bras. Parasitol. Vet.,2006

4. High level expression of the B. microplus Bm86 antigen in the yeast Pichia pastoris forming highly immunogenic particles for cattle

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Ensemble Modeling of Epitopes and Non-Epitopes Prediction from Protein Sequences of Zika and Dengue Viruses;2024 IEEE International Students' Conference on Electrical, Electronics and Computer Science (SCEECS);2024-02-24

2. Immunotherapy and targeted therapy for cholangiocarcinoma: Artificial intelligence research in imaging;Critical Reviews in Oncology/Hematology;2024-02

3. Digital Innovation Enabled Nanomaterial Manufacturing; Machine Learning Strategies and Green Perspectives;Nanomaterials;2022-08-01

4. A Deep Learning Approach with Data Augmentation to Predict Novel Spider Neurotoxic Peptides;International Journal of Molecular Sciences;2021-11-13

5. Modeling Human Innate Immune Response Using Graph Neural Networks;IEEE Access;2021