Ability of Genomic Prediction to Bi-Parent-Derived Breeding Population Using Public Data for Soybean Oil and Protein Content

Author:

Li Chenhui12ORCID,Yang Qing2,Liu Bingqiang2,Shi Xiaolei2,Liu Zhi2ORCID,Yang Chunyan2,Wang Tao3,Xiao Fuming3,Zhang Mengchen2,Shi Ainong4ORCID,Yan Long2

Affiliation:

1. College of Life Sciences, Hebei Agricultural University, Baoding 071001, China

2. Hebei Laboratory of Crop Genetics and Breeding, National Soybean Improvement Center Shijiazhuang Sub-Center, Huang-Huai-Hai Key Laboratory of Biology and Genetic Improvement of Soybean, Ministry of Agriculture and Rural Affairs, Institute of Cereal and Oil Crops, Hebei Academy of Agricultural and Forestry Sciences, High-Tech Industrial Development Zone, 162 Hengshan St., Shijiazhuang 050035, China

3. Handan Academy of Agricultural Science, Handan 056001, China

4. Department of Horticulture, University of Arkansas, Fayetteville, AR 72701, USA

Abstract

Genomic selection (GS) is a marker-based selection method used to improve the genetic gain of quantitative traits in plant breeding. A large number of breeding datasets are available in the soybean database, and the application of these public datasets in GS will improve breeding efficiency and reduce time and cost. However, the most important problem to be solved is how to improve the ability of across-population prediction. The objectives of this study were to perform genomic prediction (GP) and estimate the prediction ability (PA) for seed oil and protein contents in soybean using available public datasets to predict breeding populations in current, ongoing breeding programs. In this study, six public datasets of USDA GRIN soybean germplasm accessions with available phenotypic data of seed oil and protein contents from different experimental populations and their genotypic data of single-nucleotide polymorphisms (SNPs) were used to perform GP and to predict a bi-parent-derived breeding population in our experiment. The average PA was 0.55 and 0.50 for seed oil and protein contents within the bi-parents population according to the within-population prediction; and 0.45 for oil and 0.39 for protein content when the six USDA populations were combined and employed as training sets to predict the bi-parent-derived population. The results showed that four USDA-cultivated populations can be used as a training set individually or combined to predict oil and protein contents in GS when using 800 or more USDA germplasm accessions as a training set. The smaller the genetic distance between training population and testing population, the higher the PA. The PA increased as the population size increased. In across-population prediction, no significant difference was observed in PA for oil and protein content among different models. The PA increased as the SNP number increased until a marker set consisted of 10,000 SNPs. This study provides reasonable suggestions and methods for breeders to utilize public datasets for GS. It will aid breeders in developing GS-assisted breeding strategies to develop elite soybean cultivars with high oil and protein contents.

Funder

Natural Science Foundation of Hebei Province

National Natural Science Foundation of China

Hebei Province Modern Agricultural Industry Technology System Industry Innovation Team

Hebei Province Funding Project for Introduction of Overseas Students

China Agriculture Research System of MOF and MARA

Publisher

MDPI AG

Reference69 articles.

1. Oil Mill Gazetteer Group (2024, March 23). Oil Mill Gazetteer; American Soybean Association: 2004; p. 110. Available online: https://omg-ojs-tamu.tdl.org/omg/.

2. A Population Structure and Genome-Wide Association Analysis on the USDA Soybean Germplasm Collection;Nonoy;Plant Genome,2015

3. Hwang, E.Y., Song, Q., Jia, G., Specht, J.E., Hyten, D.L., Costa, J., and Cregan, P.B. (2014). A genome-wide association study of seed protein and oil content in soybean. BMC Genom., 15.

4. RFLP mapping in soybean: Association between marker loci and variation in quantitative traits;Keim;Genetics,1990

5. Single-Nucleotide Polymorphisms in Soybean;Zhu;Genetics,2003

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3