Highly contiguous assemblies of 101 drosophilid genomes

Author:

Kim Bernard Y1ORCID,Wang Jeremy R2ORCID,Miller Danny E3,Barmina Olga4,Delaney Emily4ORCID,Thompson Ammon4,Comeault Aaron A5ORCID,Peede David6ORCID,D'Agostino Emmanuel RR6,Pelaez Julianne7,Aguilar Jessica M7,Haji Diler7,Matsunaga Teruyuki7ORCID,Armstrong Ellie E1,Zych Molly8,Ogawa Yoshitaka9,Stamenković-Radak Marina10,Jelić Mihailo10ORCID,Veselinović Marija Savić10,Tanasković Marija11,Erić Pavle11ORCID,Gao Jian-Jun12,Katoh Takehiro K12,Toda Masanori J13ORCID,Watabe Hideaki14,Watada Masayoshi15,Davis Jeremy S16,Moyle Leonie C17,Manoli Giulia18,Bertolini Enrico18,Košťál Vladimír19,Hawley R Scott20,Takahashi Aya9ORCID,Jones Corbin D6,Price Donald K21,Whiteman Noah7ORCID,Kopp Artyom4ORCID,Matute Daniel R6,Petrov Dmitri A1ORCID

Affiliation:

1. Department of Biology, Stanford University, Stanford, United States

2. Department of Genetics, University of North Carolina, Chapel Hill, United States

3. Department of Pediatrics, Division of Genetic Medicine, University of Washington and Seattle Children’s Hospital, Seattle, United States

4. Department of Evolution and Ecology, University of California Davis, Davis, United States

5. School of Natural Sciences, Bangor University, Bangor, United Kingdom

6. Biology Department, University of North Carolina, Chapel Hill, United States

7. Department of Integrative Biology, University of California, Berkeley, Berkeley, United States

8. Molecular and Cellular Biology Program, University of Washington, Seattle, United States

9. Department of Biological Sciences, Tokyo Metropolitan University, Hachioji, Japan

10. Faculty of Biology, University of Belgrade, Belgrade, Serbia

11. University of Belgrade, Institute for Biological Research "Siniša Stanković", National Institute of Republic of Serbia, Belgrade, Serbia

12. School of Ecology and Environmental Science, Yunnan University, Kunming, China

13. Hokkaido University Museum, Hokkaido University, Sapporo, Japan

14. Biological Laboratory, Sapporo College, Hokkaido University of Education, Sapporo, Japan

15. Graduate School of Science and Engineering, Ehime University, Matsuyama, Japan

16. Department of Biology, University of Kentucky, Lexington, United States

17. Department of Biology, Indiana University, Bloomington, United States

18. Neurobiology and Genetics, Theodor Boveri Institute, Biocentre, University of Würzburg, Würzburg, Germany

19. Institute of Entomology, Biology Centre, Academy of Sciences of the Czech Republic, Prague, Czech Republic

20. Department of Molecular and Integrative Physiology, University of Kansas Medical Center, Stowers Institute for Medical Research, Kansas City, United States

21. School of Life Science, University of Nevada, Las Vegas, United States

Abstract

Over 100 years of studies in Drosophila melanogaster and related species in the genus Drosophila have facilitated key discoveries in genetics, genomics, and evolution. While high-quality genome assemblies exist for several species in this group, they only encompass a small fraction of the genus. Recent advances in long-read sequencing allow high-quality genome assemblies for tens or even hundreds of species to be efficiently generated. Here, we utilize Oxford Nanopore sequencing to build an open community resource of genome assemblies for 101 lines of 93 drosophilid species encompassing 14 species groups and 35 sub-groups. The genomes are highly contiguous and complete, with an average contig N50 of 10.5 Mb and greater than 97% BUSCO completeness in 97/101 assemblies. We show that Nanopore-based assemblies are highly accurate in coding regions, particularly with respect to coding insertions and deletions. These assemblies, along with a detailed laboratory protocol and assembly pipelines, are released as a public resource and will serve as a starting point for addressing broad questions of genetics, ecology, and evolution at the scale of hundreds of species.

Funder

National Institute of General Medical Sciences

National Institute of Diabetes and Digestive and Kidney Diseases

National Science Foundation

Google

Uehara Memorial Foundation

Ministry of Education, Science and Technological Development of the Republic of Serbia

National Natural Science Foundation of China

Japan Society for the Promotion of Science

Horizon 2020 - Research and Innovation Framework Programme

Czech Science Foundation

Publisher

eLife Sciences Publications, Ltd

Subject

General Immunology and Microbiology,General Biochemistry, Genetics and Molecular Biology,General Medicine,General Neuroscience

Reference98 articles.

1. The genome sequence of Drosophila melanogaster;Adams;Science,2000

2. One fly–one genome: chromosome-scale genome assembly of a single outbred Drosophila melanogaster;Adams;Nucleic Acids Research,2020

3. Basic local alignment search tool;Altschul;Journal of Molecular Biology,1990

4. Progressive Cactus is a multiple-genome aligner for the thousand-genome era;Armstrong;Nature,2020

5. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing;Bankevich;Journal of computational biology : a journal of computational molecular cell biology,2012

Cited by 126 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3