HaploCatcher: An R Package for Prediction of Haplotypes

Author:

Winn Zachary JamesORCID,Hudson-Arns Emily,Hammers MikaylaORCID,DeWitt NoahORCID,Lyerly JeanetteORCID,Bai GuihuaORCID,St. Amand PaulORCID,Nachappa PunyaORCID,Haley ScottORCID,Mason Richard Esten

Abstract

ABSTRACTWheat (Triticum aestivumL.) is crucial to global food security, but is often threatened by diseases, pests, and environmental stresses. Wheat stem sawfly (Cephus cinctusNorton) poeses a major threat to food security in the United States, and solid-stem varieties, which carry the stem-solidness locus (Sst1), are the main source of genetic resistance against sawfly. Marker-assisted selection uses molecular markers to identify lines possessing beneficial haplotypes, like that of theSst1locus. In this study, an R package titled "HaploCatcher" was developed to predict specific haplotypes of interest in genome-wide genotyped lines. A training population of 1,056 lines genotyped for theSst1locus, known to confer stem solidness, and genome-wide markers was curated to make predictions of theSst1haplotypes for 292 lines from the Colorado State University wheat breeding program. PredictedSst1haplotypes were compared to marker derived haplotypes. Our results indicated that the training set was substantially predictive, with kappa scores of 0.83 for k-nearest neighbors and 0.88 for random forest models. Forward validation on newly developed breeding lines demonstrated that a random forest model, trained on the total available training data, had comparable accuracy between forward and cross-validation. Estimated group means of lines classified by haplotypes from PCR-derived markers and predictive modeling did not significantly differ. The HaploCatcher package is freely available and may be utilized by breeding programs, using their own training populations, to predict haplotypes for whole genome sequenced early generation material.CORE IDEASIdentification, introgression, and frequency increase of large effect loci are important for cultivar development.TheSst1locus has a significant effect on cutting score in fields exposed to sawfly infestation.Historical genetic information can be utilized to predict haplotypes for lines which have genome-wide genetic data.An R package, HaploCatcher, has been developed to facilitate this analysis in other programs.

Publisher

Cold Spring Harbor Laboratory

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3