Long-read sequence assembly: a technical evaluation in barley

Author:

Mascher Martin12ORCID,Wicker Thomas3,Jenkins Jerry4ORCID,Plott Christopher4ORCID,Lux Thomas5ORCID,Koh Chu Shin6ORCID,Ens Jennifer7ORCID,Gundlach Heidrun5,Boston Lori B4ORCID,Tulpová Zuzana8,Holden Samuel9ORCID,Hernández-Pinzón Inmaculada9,Scholz Uwe1ORCID,Mayer Klaus F X5ORCID,Spannagl Manuel5ORCID,Pozniak Curtis J7ORCID,Sharpe Andrew G6ORCID,Šimková Hana8,Moscou Matthew J9ORCID,Grimwood Jane4ORCID,Schmutz Jeremy4ORCID,Stein Nils110ORCID

Affiliation:

1. Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Gatersleben, Seeland 06466, Germany

2. German Centre for Integrative Biodiversity Research (iDiv), Halle-Jena-Leipzig, Leipzig 04103, Germany

3. Department of Plant and Microbial Biology, University of Zürich, Zürich 8008, Switzerland

4. HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806

5. PGSB–Plant Genome and Systems Biology, Helmholtz Center Munich–German Research Center for Environmental Health, Neuherberg 85764, Germany

6. Global Institute for Food Security, University of Saskatchewan, Saskatoon SK S7N 4L8, Canada

7. Department of Plant Sciences, Crop Development Centre, University of Saskatchewan, Saskatoon SK S7N 5A8, Canada

8. Institute of Experimental Botany of the Czech Academy of Sciences, Centre of the Region Haná for Biotechnological and Agricultural Research, Olomouc 78371, Czech Republic

9. The Sainsbury Laboratory, University of East Anglia, Norwich NR4 7UH, UK

10. Center for Integrated Breeding Research (CiBreed), Georg-August-University Göttingen, Göttingen 37073, Germany

Abstract

Abstract Sequence assembly of large and repeat-rich plant genomes has been challenging, requiring substantial computational resources and often several complementary sequence assembly and genome mapping approaches. The recent development of fast and accurate long-read sequencing by circular consensus sequencing (CCS) on the PacBio platform may greatly increase the scope of plant pan-genome projects. Here, we compare current long-read sequencing platforms regarding their ability to rapidly generate contiguous sequence assemblies in pan-genome studies of barley (Hordeum vulgare). Most long-read assemblies are clearly superior to the current barley reference sequence based on short-reads. Assemblies derived from accurate long reads excel in most metrics, but the CCS approach was the most cost-effective strategy for assembling tens of barley genomes. A downsampling analysis indicated that 20-fold CCS coverage can yield very good sequence assemblies, while even five-fold CCS data may capture the complete sequence of most genes. We present an updated reference genome assembly for barley with near-complete representation of the repeat-rich intergenic space. Long-read assembly can underpin the construction of accurate and complete sequences of multiple genomes of a species to build pan-genome infrastructures in Triticeae crops and their wild relatives.

Funder

German Ministry of Education and Research

SHAPE II

de.NBI

Czech Science Foundation

ERDF

ELIXIR-CZ project

The Gatsby Charitable Foundation

DTP

Genome Canada

Canada First Research Excellence Fund

Publisher

Oxford University Press (OUP)

Subject

Cell Biology,Plant Science

Cited by 164 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3