Origin matters: Using a local reference genome improves measures in population genomics

Author:

Thorburn Doko‐Miles J.12ORCID,Sagonas Kostas13,Binzer‐Panchal Mahesh4,Chain Frederic J. J.5ORCID,Feulner Philine G. D.67ORCID,Bornberg‐Bauer Erich8,Reusch Thorsten B. H.9,Samonte‐Padilla Irene E.10,Milinski Manfred10,Lenz Tobias L.1112,Eizaguirre Christophe1

Affiliation:

1. School of Biological and Chemical Sciences Queen Mary University of London London UK

2. Department of Life Sciences Imperial College London London UK

3. Department of Zoology, School of Biology Aristotle University of Thessaloniki Thessaloniki Greece

4. Science for Life Laboratory, Department of Medical Biochemistry and Microbiology, National Bioinformatics Infrastructure Sweden (NBIS) Uppsala University Uppsala Sweden

5. Department of Biological Sciences University of Massachusetts Lowell Lowell Massachusetts USA

6. Department of Fish Ecology and Evolution, Centre of Ecology, Evolution and Biogeochemistry EAWAG Swiss Federal Institute of Aquatic Science and Technology Kastanienbaum Switzerland

7. Division of Aquatic Ecology and Evolution, Institute of Ecology and Evolution University of Bern Bern Switzerland

8. Evolutionary Bioinformatics, Institute for Evolution and Biodiversity University of Münster Münster Germany

9. Marine Evolutionary Ecology, GEOMAR Helmholtz Centre for Ocean Research Kiel Germany

10. Department of Evolutionary Ecology Max Planck Institute for Evolutionary Biology Plön Germany

11. Research Group for Evolutionary Immunogenomics Max Planck Institute for Evolutionary Biology Plön Germany

12. Research Unit for Evolutionary Immunogenomics, Department of Biology University of Hamburg Hamburg Germany

Abstract

AbstractGenome sequencing enables answering fundamental questions about the genetic basis of adaptation, population structure and epigenetic mechanisms. Yet, we usually need a suitable reference genome for mapping population‐level resequencing data. In some model systems, multiple reference genomes are available, giving the challenging task of determining which reference genome best suits the data. Here, we compared the use of two different reference genomes for the three‐spined stickleback (Gasterosteus aculeatus), one novel genome derived from a European gynogenetic individual and the published reference genome of a North American individual. Specifically, we investigated the impact of using a local reference versus one generated from a distinct lineage on several common population genomics analyses. Through mapping genome resequencing data of 60 sticklebacks from across Europe and North America, we demonstrate that genetic distance among samples and the reference genomes impacts downstream analyses. Using a local reference genome increased mapping efficiency and genotyping accuracy, effectively retaining more and better data. Despite comparable distributions of the metrics generated across the genome using SNP data (i.e. π, Tajima's D and FST), window‐based statistics using different references resulted in different outlier genes and enriched gene functions. A marker‐based analysis of DNA methylation distributions had a comparably high overlap in outlier genes and functions, yet with distinct differences depending on the reference genome. Overall, our results highlight how using a local reference genome decreases reference bias to increase confidence in downstream analyses of the data. Such results have significant implications in all reference‐genome‐based population genomic analyses.

Funder

Deutsche Forschungsgemeinschaft

Publisher

Wiley

Subject

Genetics,Ecology, Evolution, Behavior and Systematics,Biotechnology

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3