Knowledge-guided data mining on the standardized architecture of NRPS: Subtypes, novel motifs, and sequence entanglements-Reference-Cited by-同舟云学术

Knowledge-guided data mining on the standardized architecture of NRPS: Subtypes, novel motifs, and sequence entanglements

Published:2023-05-15 Issue:5 Volume:19 Page:e1011100
ISSN:1553-7358
Container-title:PLOS Computational Biology
language:en
Short-container-title:PLoS Comput Biol

Author:

He Ruolin^ORCID,Zhang Jinyu,Shao Yuanzhe,Gu Shaohua,Song Chen^ORCID,Qian Long^ORCID,Yin Wen-Bing^ORCID,Li Zhiyuan^ORCID

Abstract

Non-ribosomal peptide synthetase (NRPS) is a diverse family of biosynthetic enzymes for the assembly of bioactive peptides. Despite advances in microbial sequencing, the lack of a consistent standard for annotating NRPS domains and modules has made data-driven discoveries challenging. To address this, we introduced a standardized architecture for NRPS, by using known conserved motifs to partition typical domains. This motif-and-intermotif standardization allowed for systematic evaluations of sequence properties from a large number of NRPS pathways, resulting in the most comprehensive cross-kingdom C domain subtype classifications to date, as well as the discovery and experimental validation of novel conserved motifs with functional significance. Furthermore, our coevolution analysis revealed important barriers associated with re-engineering NRPSs and uncovered the entanglement between phylogeny and substrate specificity in NRPS sequences. Our findings provide a comprehensive and statistically insightful analysis of NRPS sequences, opening avenues for future data-driven discoveries.

Funder

Key Technologies Research and Development Program

National Natural Science Foundation of China

Clinical Medicine Plus X - Young Scholars Project, Peking University, the Fundamental Research Funds for the Central Universities

The Biological Resources Program, Chinese Academy of Sciences

Key Project of Frontier Science Research of Chinese Academy of Sciences

National Postdoctoral Program for Innovative Talents

Publisher

Public Library of Science (PLoS)

Subject

Computational Theory and Mathematics,Cellular and Molecular Neuroscience,Genetics,Molecular Biology,Ecology,Modeling and Simulation,Ecology, Evolution, Behavior and Systematics

Reference123 articles.

1. Atlas of nonribosomal peptide and polyketide biosynthetic pathways reveals common occurrence of nonmodular enzymes;H Wang;Proceedings of the National Academy of Sciences,2014

2. The Ecological Role of Volatile and Soluble Secondary Metabolites Produced by Soil Bacteria;O Tyc;Trends in Microbiology,2017

3. An atlas of bacterial secondary metabolite biosynthesis gene clusters;B Wei;Environmental Microbiology,2021

4. Global biogeographic sampling of bacterial secondary metabolism;Z Charlop-Powers;eLife,2015

5. High-accuracy alignment ensembles enable unbiased assessments of sequence homology and phylogeny;RC Edgar;bioRxiv,2022

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. From sequence to molecules: Feature sequence-based genome mining uncovers the hidden diversity of bacterial siderophore pathways;2024-08-13

2. From sequence to molecules: Feature sequence-based genome mining uncovers the hidden diversity of bacterial siderophore pathways;2024-08-13

3. From sequence to molecules: Feature sequence-based genome mining uncovers the hidden diversity of bacterial siderophore pathways;2024-05-01

4. SIDERITE: Unveiling hidden siderophore diversity in the chemical space through digital exploration;iMeta;2024-04

5. Evolution-inspired engineering of nonribosomal peptide synthetases;Science;2024-03-22