Choosing the optimal population for a genome-wide association study: a simulation using whole-genome sequences from rice

Author:

Hamazaki KosukeORCID,Kajiya-Kanegae HiromiORCID,Yamasaki MasanoriORCID,Ebana Kaworu,Yabe ShioriORCID,Nakagawa HiroshiORCID,Iwata Hiroyoshi

Abstract

ABSTRACTA genome-wide association study (GWAS) needs to have a suitable population. The factors that affect a GWAS, e.g. population structure, sample size, and sequence analysis and field testing costs, need to be considered. Mixture populations containing subpopulations of different genetic backgrounds may be suitable populations. We conducted simulation experiments to see if a population with high genetic diversity, e.g., a diversity panel, should be added to a target population, especially when the target population harbors small genetic diversity. The target population was 112 accessions ofOryza sativasubsp.japonica, mainly developed in Japan. We combined the target population with three populations that had higher genetic diversities. These were 100indicaaccessions, 100japonicaaccessions, and 100 accessions with various genetic backgrounds. The results showed that the GWAS power with a mixture population was generally higher than with a separate population. Also, the GWAS optimal population varied depending on the fixation indexFSTof the quantitative trait nucleotide (QTN) and its polymorphism of QTN in each population. When a QTN is polymorphic in a target population, a target population combined with a higher diversity population improves the QTN detection power. InvestigatingFSTand the expected heterozygosityHeas factors influencing the detection power, we showed that SNPs with highFSTor lowHeare less likely to be detected by GWAS with mixture populations. Sequenced/genotyped germplasm collections can improve the GWAS detection power by using a subset of them with a target population.Core ideas (3-5 impact statements, 85 char max for each)- Genome-wide association studies with mixture populations are expected to improve the detection power of novel genes due to the increase of the sample size although the influence of population structure is a concern.- When a quantitative trait nucleotide (QTN) is polymorphic in a target population, a combination of the target population and a population with higher diversity than the target population improves the detection power of the QTN.- We found that the fixation index (FST) and the expected heterozygosity (He) were strongly related to the detection power of QTNs.- Germplasm collections which have been already sequenced/genotyped are useful for improving the detection power of GWAS without any addition of sequence costs by using a subset of them with a target population.

Publisher

Cold Spring Harbor Laboratory

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3