Trait association and prediction through integrative k‐mer analysis

Author:

He Cheng1,Washburn Jacob D.2,Schleif Nathaniel3,Hao Yangfan1,Kaeppler Heidi3ORCID,Kaeppler Shawn M.3,Zhang Zhiwu4,Yang Jinliang56ORCID,Liu Sanzhen1ORCID

Affiliation:

1. Department of Plant Pathology Kansas State University Manhattan Kansas 66506 USA

2. Plant Genetics Research Unit USDA‐ARS Columbia Missouri 65211 USA

3. Department of Agronomy University of Wisconsin‐Madison Madison Wisconsin 53706 USA

4. Department of Crop and Soil Sciences Washington State University Pullman Washington 99164 USA

5. Department of Agronomy and Horticulture University of Nebraska‐Lincoln Lincoln Nebraska 68583‐0915 USA

6. Center for Plant Science Innovation University of Nebraska‐Lincoln Lincoln Nebraska 68583 USA

Abstract

SUMMARYGenome‐wide association study (GWAS) with single nucleotide polymorphisms (SNPs) has been widely used to explore genetic controls of phenotypic traits. Alternatively, GWAS can use counts of substrings of length k from longer sequencing reads, k‐mers, as genotyping data. Using maize cob and kernel color traits, we demonstrated that k‐mer GWAS can effectively identify associated k‐mers. Co‐expression analysis of kernel color k‐mers and genes directly found k‐mers from known causal genes. Analyzing complex traits of kernel oil and leaf angle resulted in k‐mers from both known and candidate genes. A gene encoding a MADS transcription factor was functionally validated by showing that ectopic expression of the gene led to less upright leaves. Evolution analysis revealed most k‐mers positively correlated with kernel oil were strongly selected against in maize populations, while most k‐mers for upright leaf angle were positively selected. In addition, genomic prediction of kernel oil, leaf angle, and flowering time using k‐mer data resulted in a similarly high prediction accuracy to the standard SNP‐based method. Collectively, we showed k‐mer GWAS is a powerful approach for identifying trait‐associated genetic elements. Further, our results demonstrated the bridging role of k‐mers for data integration and functional gene discovery.

Funder

Division of Integrative Organismal Systems

National Institute of Food and Agriculture

Basic Energy Sciences

Publisher

Wiley

Reference87 articles.

1. The genetic relationship between proteins, oil and grain yield in some maize hybrids;Abou‐Deif M.H.;World Journal of Agricultural Sciences,2012

2. Determining the quality and complexity of next‐generation sequencing data without a reference genome;Anvar S.Y.;Genome Biology,2014

3. Selection for leaf angle in maize and its effect on grain yield and other characters;Ariyanayagam R.P.;Crop Science,1974

4. Genetic loci controlling carotenoid biosynthesis in diverse tropical maize lines;Azmach G.;G3,2018

5. A tutorial on statistical methods for population association studies;Balding D.J.;Nature Reviews Genetics,2006

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3