Mining bacterial NGS data vastly expands the complete genomes of temperate phages

Author:

Zhang Xianglilan1ORCID,Wang Ruohan2ORCID,Xie Xiangcheng3,Hu Yunjia45ORCID,Wang Jianping2,Sun Qiang6,Feng Xikang7ORCID,Lin Wei4,Tong Shanwei89,Yan Wei10,Wen Huiqi14,Wang Mengyao2,Zhai Shixiang111213,Sun Cheng14,Wang Fangyi15,Niu Qi14,Kropinski Andrew M16,Cui Yujun1,Jiang Xiaofang10ORCID,Peng Shaoliang14,Li Shuaicheng2ORCID,Tong Yigang4ORCID

Affiliation:

1. State Key Laboratory of Pathogen and Biosecurity, Beijing Institute of Microbiology and Epidemiology , Beijing  100071,  People's Republic of China

2. Department of Computer Science, City University of Hong Kong , Hong Kong  999077,  People's Republic of China

3. College of Computer, National University of Defense Technology , Changsha  410073,  People's Republic of China

4. Beijing Advanced Innovation Center for Soft Matter Science and Engineering (BAIC-SM), College of Life Science and Technology, Beijing University of Chemical Technology , Beijing  100029,  People's Republic of China

5. School of Medicine, Shanghai University , Shanghai  200444,  People's Republic of China

6. The 964th Hospital , Changchun  130021,  People's Republic of China

7. School of Software, Northwestern Polytechnical University , Xi’an  710072,  People's Republic of China

8. Bioinformatics Graduate Program, University of British Columbia , Vancouver BC  V6T 1Z4,  Canada

9. Faculty of Health Sciences, Simon Fraser University , Burnaby ,  BC  V5A 1S6, Canada

10. National Library of Medicine, National Institutes of Health , Bethesda ,  MD  20894, USA

11. Yantai Institute of Coastal Zone Research, Chinese Academy of Sciences , Yantai  264003,  People's Republic of China

12. University of Chinese Academy of Sciences , Beijing  100049,  People's Republic of China

13. Center for Ocean Mega-Science, Chinese Academy of Sciences , Qingdao  266071,  People's Republic of China

14. School of Computer Science and Electronic Engineering, Hunan University , Changsha  410082,  People's Republic of China

15. Department of Statistics, the Ohio State University , Columbus, OH  43210,  USA

16. Departments of Food Science, and Pathobiology, University of Guelph ,  Guelph ,  ON N1G 2W1 , Canada

Abstract

Abstract Temperate phages (active prophages induced from bacteria) help control pathogenicity, modulate community structure, and maintain gut homeostasis. Complete phage genome sequences are indispensable for understanding phage biology. Traditional plaque techniques are inapplicable to temperate phages due to their lysogenicity, curbing their identification and characterization. Existing bioinformatics tools for prophage prediction usually fail to detect accurate and complete temperate phage genomes. This study proposes a novel computational temperate phage detection method (TemPhD) mining both the integrated active prophages and their spontaneously induced forms (temperate phages) from next-generation sequencing raw data. Applying the method to the available dataset resulted in 192 326 complete temperate phage genomes with different host species, expanding the existing number of complete temperate phage genomes by more than 100-fold. The wet-lab experiments demonstrated that TemPhD can accurately determine the complete genome sequences of the temperate phages, with exact flanking sites, outperforming other state-of-the-art prophage prediction methods. Our analysis indicates that temperate phages are likely to function in the microbial evolution by (i) cross-infecting different bacterial host species; (ii) transferring antibiotic resistance and virulence genes and (iii) interacting with hosts through restriction-modification and CRISPR/anti-CRISPR systems. This work provides a comprehensively complete temperate phage genome database and relevant information, which can serve as a valuable resource for phage research.

Funder

National Natural Science Foundation of China

National Key Research and Development Program of China

Key Research and Development Program of Hebei Province

National Library of Medicine

National Institutes of Health

Publisher

Oxford University Press (OUP)

Subject

Applied Mathematics,Computer Science Applications,Genetics,Molecular Biology,Structural Biology

Cited by 7 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3