T1SEstacker: A Tri-Layer Stacking Model Effectively Predicts Bacterial Type 1 Secreted Proteins Based on C-Terminal Non-repeats-in-Toxin-Motif Sequence Features-Reference-Cited by-同舟云学术

T1SEstacker: A Tri-Layer Stacking Model Effectively Predicts Bacterial Type 1 Secreted Proteins Based on C-Terminal Non-repeats-in-Toxin-Motif Sequence Features

Published:2022-02-08 Issue: Volume:12 Page:
ISSN:1664-302X
Container-title:Frontiers in Microbiology
language:
Short-container-title:Front. Microbiol.

Author:

Chen Zewei,Zhao Ziyi,Hui Xinjie,Zhang Junya,Hu Yixue,Chen Runhong,Cai Xuxia,Hu Yueming,Wang Yejun

Abstract

Type 1 secretion systems play important roles in pathogenicity of Gram-negative bacteria. However, the substrate secretion mechanism remains largely unknown. In this research, we observed the sequence features of repeats-in-toxin (RTX) proteins, a major class of type 1 secreted effectors (T1SEs). We found striking non-RTX-motif amino acid composition patterns at the C termini, most typically exemplified by the enriched “[FLI][VAI]” at the most C-terminal two positions. Machine-learning models, including deep-learning ones, were trained using these sequence-based non-RTX-motif features and further combined into a tri-layer stacking model, T1SEstacker, which predicted the RTX proteins accurately, with a fivefold cross-validated sensitivity of ∼0.89 at the specificity of ∼0.94. Besides substrates with RTX motifs, T1SEstacker can also well distinguish non-RTX-motif T1SEs, further suggesting their potential existence of common secretion signals. T1SEstacker was applied to predict T1SEs from the genomes of representative Salmonella strains, and we found that both the number and composition of T1SEs varied among strains. The number of T1SEs is estimated to reach 100 or more in each strain, much larger than what we expected. In summary, we made comprehensive sequence analysis on the type 1 secreted RTX proteins, identified common sequence-based features at the C termini, and developed a stacking model that can predict type 1 secreted proteins accurately.

Publisher

Frontiers Media SA

Subject

Microbiology (medical),Microbiology

Reference44 articles.

1. Structure, assembly, and function of tripartite efflux and type 1 secretion systems in gram-negative bacteria.;Alav;Chem. Rev.,2021

2. SignalP 5.0 improves signal peptide predictions using deep neural networks.;Almagro Armenteros;Nat. Biotechnol.,2019

3. The giant adhesin SiiE of Salmonella enterica.;Barlag;Molecules,2015

4. Structural features of the Pseudomonas fluorescens biofilm adhesin LapA required for LapG-dependent cleavage, biofilm formation, and cell surface localization.;Boyd;J. Bacteriol.,2014

5. Type I secretion in gram-negative bacteria.;Delepelaire;Biochim. Biophys. Acta,2004

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Comprehensive Genomic Analysis Reveals Extensive Diversity of Type I and Type IV Secretion Systems in Klebsiella pneumoniae;Current Microbiology;2023-07-05

2. Title: Toleration of Frameshift Mutations in mRNA Sequences Encoding the N-terminal Peptides of Bacterial Type III Effectors;2023-04-10

3. DeepSecE: A Deep-Learning-Based Framework for Multiclass Prediction of Secreted Proteins in Gram-Negative Bacteria;Research;2023-01

4. Redefining the bacterial Type I protein secretion system;Advances in Microbial Physiology;2023