Haplotype-Phased Genome Assembly of Virulent Phytophthora ramorum Isolate ND886 Facilitated by Long-Read Sequencing Reveals Effector Polymorphisms and Copy Number Variation

Author:

Malar C Mathu12,Yuzon Jennifer D.34,Das Subhadeep12,Das Abhishek12,Panda Arijit12,Ghosh Samrat12,Tyler Brett M.5,Kasuga Takao34,Tripathy Sucheta12ORCID

Affiliation:

1. Computational Genomics Lab, Structural Biology and Bioinformatics Division, CSIR Indian Institute of Chemical Biology, Kolkata, 700032, India

2. Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India

3. Department of Plant Pathology, University of California, Davis, CA, U.S.A.

4. USDA-ARS, Davis, CA, U.S.A.

5. Center for Genome Research and Biocomputing and Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR 97331-7303, U.S.A.

Abstract

Phytophthora ramorum is a destructive pathogen that causes sudden oak death disease. The genome sequence of P. ramorum isolate Pr102 was previously produced, using Sanger reads, and contained 12 Mb of gaps. However, isolate Pr102 had shown reduced aggressiveness and genome abnormalities. In order to produce an improved genome assembly for P. ramorum, we performed long-read sequencing of highly aggressive P. ramorum isolate CDFA1418886 (abbreviated as ND886). We generated a 60.5-Mb assembly of the ND886 genome using the Pacific Biosciences (PacBio) sequencing platform. The assembly includes 302 primary contigs (60.2 Mb) and nine unplaced contigs (265 kb). Additionally, we found a ‘highly repetitive’ component from the PacBio unassembled unmapped reads containing tandem repeats that are not part of the 60.5-Mb genome. The overall repeat content in the primary assembly was much higher than the Pr102 Sanger version (48 versus 29%), indicating that the long reads have captured repetitive regions effectively. The 302 primary contigs were phased into 345 haplotype blocks and 222,892 phased variants, of which the longest phased block was 1,513,201 bp with 7,265 phased variants. The improved phased assembly facilitated identification of 21 and 25 Crinkler effectors and 393 and 394 RXLR effector genes from two haplotypes. Of these, 24 and 25 RXLR effectors were newly predicted from haplotypes A and B, respectively. In addition, seven new paralogs of effector Avh207 were found in contig 54, not reported earlier. Comparison of the ND886 assembly with Pr102 V1 assembly suggests that several repeat-rich smaller scaffolds within the Pr102 V1 assembly were possibly misassembled; these regions are fully encompassed now in ND886 contigs. Our analysis further reveals that Pr102 is a heterokaryon with multiple nuclear types in the sequences corresponding to contig 10 of ND886 assembly.

Publisher

Scientific Societies

Subject

Agronomy and Crop Science,General Medicine,Physiology

Cited by 22 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3