Genomic prediction with machine learning in sugarcane, a complex highly polyploid clonally propagated crop with substantial non‐additive variation for key traits

Author:

Chen Chensong1ORCID,Powell Owen1ORCID,Dinglasan Eric1ORCID,Ross Elizabeth M.1ORCID,Yadav Seema1ORCID,Wei Xianming2,Atkin Felicity3,Deomano Emily4,Hayes Ben J.1ORCID

Affiliation:

1. Queensland Alliance for Agriculture and Food Innovation University of Queensland Queensland Australia

2. Sugar Research Australia Mackay Australia

3. Sugar Research Australia Gordonvale Australia

4. Sugar Research Australia Indooroopilly Australia

Abstract

AbstractSugarcane has a complex, highly polyploid genome with multi‐species ancestry. Additive models for genomic prediction of clonal performance might not capture interactions between genes and alleles from different ploidies and ancestral species. As such, genomic prediction in sugarcane presents an interesting case for machine learning (ML) methods, which are purportedly able to deal with high levels of complexity in prediction. Here, we investigated deep learning (DL) neural networks, including multilayer networks (MLP) and convolution neural networks (CNN), and an ensemble machine learning approach, random forest (RF), for genomic prediction in sugarcane. The data set used was 2912 sugarcane clones, scored for 26,086 genome wide single nucleotide polymorphism markers, with final assessment trial data for total cane harvested (TCH), commercial cane sugar (CCS), and fiber content (Fiber). The clones in the latest trial (2017) were used as a validation set. We compared prediction accuracy of these methods to genomic best linear unbiased prediction (GBLUP) extended to include dominance and epistatic effects. The prediction accuracies from GBLUP models were up to 0.37 for TCH, 0.43 for CCS, and 0.48 for Fiber, while the optimized ML models had prediction accuracies of 0.35 for TCH, 0.38 for CCS, and 0.48 for Fiber. Both RF and DL neural network models have comparable predictive ability with the additive GBLUP model but are less accurate than the extended GBLUP model.

Funder

Sugar Research Australia

Publisher

Wiley

Subject

Plant Science,Agronomy and Crop Science,Genetics

Reference78 articles.

1. Abadi M. Agarwal A. Barham P. Brevdo E. Chen Z. Citro C. Corrado G. S. Davis A. Dean J. &Devin M.(2016).Tensorflow: Large‐scale machine learning on heterogeneous distributed systems.arXiv preprint arXiv:1603.04467.

2. Generation of a 345K sugarcane SNP chip;Aitken K.;Proceedings of the Australian Society of Sugar Cane Technologists,2016

3. Genomic Prediction for Grain Yield and Yield-Related Traits in Chinese Winter Wheat

4. Benchmarking Parametric and Machine Learning Models for Genomic Prediction of Complex Traits

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3