Accelerating High-Performance Classification of Bacterial Proteins Secreted via Non-Classical Pathways: no needing for deepness

Author:

Oliveira Luiz Gustavo de Sousa,Lanes Gabriel Chagas,Santos Anderson Rodrigues dosORCID

Abstract

AbstractUnderstanding protein secretion pathways is paramount in studying diseases caused by bacteria and their respective treatments. Most such paths must signal ways to identify secretion. However, some proteins, known as non-classical secreted proteins, do not have signaling ways. This study aims to classify such proteins from predictive machine-learning techniques. We collected a set of physical-chemical characteristics of amino acids from the AA index site, bolding known protein motifs, like hydrophobicity. We developed a six-step method (Alignment, Preliminary classification, mean outliers, two Clustering algorithms, and Random choice) to filter data from raw genomes and compose a negative dataset in contrast to a positive dataset of 141 proteins from the literature. Using a conventional Random Forest machine-learning algorithm, we obtained an accuracy of 91% on classifying non-classical secreted proteins in a validation dataset with 14 positive and 92 negative proteins - sensitivity and specificity of 91 and 86%, respectively, performance compared to state of the art for non-classical secretion classification. However, this work’s novelty resides in the fastness of executing non-CSP classification: instead of dozens of seconds to just one second considering a few dozen protein samples or only ten seconds to classify one hundred thousand proteins. Such fastness is more suitable for pan-genomic analyses than current methods without losing accuracy. Therefore, this research has shown that selecting an appropriate descriptors’ set and an expressive training dataset compensates for not using an advanced machine learning algorithm for the secretion by non-classical pathways purpose. Available athttps://github.com/santosardr/non-CSPs.

Publisher

Cold Spring Harbor Laboratory

Reference27 articles.

1. E. R. Green and J. Mecsas , “Bacterial Secretion Systems: An Overview,” Microbiology Spectrum, vol. 4, no. 1, Jan. 2016. [Online]. Available: /pmc/articles/PMC4804464//pmc/articles/PMC4804464/?report=abstracthttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC4804464/

2. Pathways of Protein Secretion in Eukaryotes

3. Tiny architects: biogenesis of intracellular replicative niches by bacterial pathogens

4. Bacterial secreted proteins are required for the internalization of Campylobacter jejuni into cultured mammalian cells

5. “Bacterial Virulence Factors: Secreted for Survival;Indian Journal of Microbiology,2017

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3