quarTeT: a telomere-to-telomere toolkit for gap-free genome assembly and centromeric repeat identification

Author:

Lin Yunzhi12ORCID,Ye Chen34,Li Xingzhu2,Chen Qinyao2,Wu Ying2,Zhang Feng2,Pan Rui2,Zhang Sijia2,Chen Shuxia34,Wang Xu5,Cao Shuo56,Wang Yingzhen2,Yue Yi34,Liu Yongsheng124,Yue Junyang25ORCID

Affiliation:

1. Sichuan University College of Life Science, , Chengdu, Sichuan 610064, China

2. Anhui Agricultural University School of Horticulture, , Hefei, Anhui 230036, China

3. Anhui Agricultural University School of Information and Computer, , Hefei, Anhui 230036, China

4. Anhui Agricultural University State Key Laboratory of Tea Plant Biology and Utilization, , Hefei, Anhui 230036, China

5. Chinese Academy of Agricultural Sciences Agricultural Genomics Institute at Shenzhen, , Shenzhen, Guangdong 518124, China

6. Huazhong Agricultural University Key Laboratory of Horticultural Plant Biology Ministry of Education, , Wuhan, Hubei 430070, China

Abstract

Abstract A high-quality genome is the basis for studies on functional, evolutionary, and comparative genomics. The majority of attention has been paid to the solution of complex chromosome structures and highly repetitive sequences, along with the emergence of a new ‘telomere-to-telomere (T2T) assembly’ era. However, the bioinformatic tools for the automatic construction and/or characterization of T2T genome are limited. Here, we developed a user-friendly web toolkit, quarTeT, which currently includes four modules: AssemblyMapper, GapFiller, TeloExplorer, and CentroMiner. First, AssemblyMapper is designed to assemble phased contigs into the chromosome-level genome by referring to a closely related genome. Then, GapFiller would endeavor to fill all unclosed gaps in a given genome with the aid of additional ultra-long sequences. Finally, TeloExplorer and CentroMiner are applied to identify candidate telomere and centromere as well as their localizations on each chromosome. These four modules can be used alone or in combination with each other for T2T genome assembly and characterization. As a case study, by adopting the entire modular functions of quarTeT, we have achieved the Actinidia chinensis genome assembly that is of a quality comparable to the reported genome Hongyang v4.0, which was assembled with the addition of manual handling. Further evaluation of CentroMiner by searching centromeres in Arabidopsis thaliana and Oryza sativa genomes showed that quarTeT is capable of identifying all the centromeric regions that have been previously detected by experimental methods. Collectively, quarTeT is an efficient toolkit for studies of large-scale T2T genomes and can be accessed at http://www.atcgn.com:8080/quarTeT/home.html without registration.

Publisher

Oxford University Press (OUP)

Subject

Horticulture,Plant Science,Genetics,Biochemistry,Biotechnology

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3