Canfam_GSD: De novo chromosome-length genome assembly of the German Shepherd Dog (Canis lupus familiaris) using a combination of long reads, optical mapping, and Hi-C

Author:

Field Matt A12ORCID,Rosen Benjamin D3ORCID,Dudchenko Olga456ORCID,Chan Eva K F78ORCID,Minoche Andre E79,Edwards Richard J10ORCID,Barton Kirston78ORCID,Lyons Ruth J7,Tuipulotu Daniel Enosi10ORCID,Hayes Vanessa M7811ORCID,D. Omer Arina45,Colaric Zane45,Keilwagen Jens12ORCID,Skvortsova Ksenia7ORCID,Bogdanovic Ozren710ORCID,Smith Martin A78,Aiden Erez Lieberman4561314ORCID,Smith Timothy P L15,Zammit Robert A16ORCID,Ballard J William O10ORCID

Affiliation:

1. Centre for Tropical Bioinformatics and Molecular Biology, Australian Institute of Tropical Health and Medicine, James Cook University, Smithfield Road, Cairns, QLD 4878, Australia

2. John Curtin School of Medical Research, Australian National University, Garran Rd, Canberra, ACT 2600, Australia

3. Animal Genomics and Improvement Laboratory, Agricultural Research Service USDA, Baltimore Ave, Beltsville, MD 20705, USA

4. The Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, Baylor Plaza, Houston, TX 77030, USA

5. Department of Computer Science, Rice University, Main St, Houston, TX 77005, USA

6. Center for Theoretical and Biological Physics, Rice University, Main St, Houston, TX 77005, USA

7. Garvan Institute of Medical Research, Victoria Street, Darlinghurst, NSW 2010, Australia

8. Faculty of Medicine, UNSW Sydney, High St, Kensington, NSW 2052, Australia

9. St Vincent’s Clinical School, University of New South Wales Sydney, Victoria Street, Darlinghurst NSW 2010, Australia

10. School of Biotechnology and Biomolecular Sciences, UNSW Sydney, High St, Kensington, NSW 2052, Australia

11. Central Clinical School, University of Sydney, Parramatta Road, Camperdown, NSW 2050, Australia

12. Julius Kühn-Institut, Erwin-Baur-Str. 27, 06484 Quedlinburg, Germany

13. Broad Institute of MIT and Harvard, Main St, Cambridge, MA 02142, USA

14. Shanghai Institute for Advanced Immunochemical Studies, ShanghaiTech University, ShanghaiTech University, Huaxia Middle Rd, Pudong 201210, China

15. US Meat Animal Research Center, Agricultural Research Service USDA, Rd 313, Clay Center, NE 68933, USA

16. Vineyard Veterinary Hospital, Windsor Rd, Vineyard, NSW 2765, Australia

Abstract

Abstract Background The German Shepherd Dog (GSD) is one of the most common breeds on earth and has been bred for its utility and intelligence. It is often first choice for police and military work, as well as protection, disability assistance, and search-and-rescue. Yet, GSDs are well known to be susceptible to a range of genetic diseases that can interfere with their training. Such diseases are of particular concern when they occur later in life, and fully trained animals are not able to continue their duties. Findings Here, we provide the draft genome sequence of a healthy German Shepherd female as a reference for future disease and evolutionary studies. We generated this improved canid reference genome (CanFam_GSD) utilizing a combination of Pacific Bioscience, Oxford Nanopore, 10X Genomics, Bionano, and Hi-C technologies. The GSD assembly is ∼80 times as contiguous as the current canid reference genome (20.9 vs 0.267 Mb contig N50), containing far fewer gaps (306 vs 23,876) and fewer scaffolds (429 vs 3,310) than the current canid reference genome CanFamv3.1. Two chromosomes (4 and 35) are assembled into single scaffolds with no gaps. BUSCO analyses of the genome assembly results show that 93.0% of the conserved single-copy genes are complete in the GSD assembly compared with 92.2% for CanFam v3.1. Homology-based gene annotation increases this value to ∼99%. Detailed examination of the evolutionarily important pancreatic amylase region reveals that there are most likely 7 copies of the gene, indicative of a duplication of 4 ancestral copies and the disruption of 1 copy. Conclusions GSD genome assembly and annotation were produced with major improvement in completeness, continuity, and quality over the existing canid reference. This resource will enable further research related to canine diseases, the evolutionary relationships of canids, and other aspects of canid biology.

Funder

National Science Foundation

Welch Foundation

U.S. Department of Agriculture

National Institutes of Health

Australian Research Council

Publisher

Oxford University Press (OUP)

Subject

Computer Science Applications,Health Informatics

Cited by 49 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3