Cost-effective hybrid long-short read assembly delineates alternative GC-richStreptomyceschassis for natural product discovery

Author:

Heng Elena,Tan Lee Ling,Tay Dillon W. P.,Lim Yee Hwee,Yang Lay-Kien,Seow Deborah C.S.,Leong Chung Yan,Ng Veronica,Ng Siew Bee,Kanagasundaram Yoganathan,Wong Fong TianORCID,Koduru LokanandORCID

Abstract

AbstractWith the advent of rapid automatedin silicoidentification of biosynthetic gene clusters (BGCs), genomics presents vast opportunities to accelerate natural product (NP) discovery. However, prolific NP producers,Streptomyces, are exceptionally GC-rich (>80%) and highly repetitive within BGCs. These pose challenges in sequencing and high-quality genome assembly which are currently circumventedviaintensive sequencing. Here, we outline a more cost-effective workflow using multiplex Illumina and Oxford Nanopore sequencing with hybrid long-short read assembly algorithms to generate high quality genomes. Our protocol involves subjecting long read-derived assemblies to up to 4 rounds of polishing with short reads to yield accurate BGC predictions. We successfully sequenced and assembled 8 GC-richStreptomycesgenomes whose lengths range from 7.1 to 12.1 Mb at an average N50 of 5.9 Mb. Taxonomic analysis revealed previous misrepresentation among these strains and allowed us to propose a potentially new species,Streptomyces sydneybrenneri. Further comprehensive characterization of their biosynthetic, pan-genomic and antibiotic resistance features especially for molecules derived from type I polyketide synthase (PKS) BGCs reflected their potential as NP chassis. Thus, the genome assemblies and insights presented here are envisioned to serve as gateway for the scientific community to expand their avenues in NP discovery.Graphic abstractSchematic of hybrid long- and short read assembly workflow for genome sequencing of GC-richStreptomyces. Boxes shaded blue and grey correspond to experimental andin silicoworkflows, respectively.HighlightsA cost-effective genome sequencing approach for GC-richStreptomycesis presentedHybrid assembly improves BGC annotation and identificationA new species,Streptomyces sydneybrenneri, identified by taxonomic analysisGenomes of 8Streptomycesspecies are reported and analysed in this study

Publisher

Cold Spring Harbor Laboratory

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3