An investigation of the impact of imbalance on the analysis of the US crop variety evaluation program data

Author:

Fang Zhou1ORCID,Deng Dewayne D.2,Jenkins Johnie N.2,Zhou Qian M.1ORCID

Affiliation:

1. Department of Mathematics and Statistics Mississippi State University Mississippi State Mississippi USA

2. USDA‐ARS, Genetics and Sustainable Agriculture Research Unit, Crop Science Research Laboratory Mississippi State Mississippi USA

Abstract

AbstractMulti‐environment trial data from many crop variety evaluation programs are imbalanced because only a subset of varieties is selected for the following year, which leads to missing variety by year. Inspired by the US National Cotton Variety Test trial, we conducted new simulation studies to investigate selection processes that differ from the existing literature. The followings are our four main contributions. First, we adopted a framework that utilizes a logistic regression to generate imbalanced data that follow missing completely at random, missing at random, or missing not at random (MNAR). Second, our selection process can depend on multiple traits, whereas all existing studies only used a single trait for selection. Third, besides variance components (VCs), long‐term trends that reflect genetic and non‐genetic development are of interest since the simulated data span over 30 years. Last, we evaluated the prediction accuracy for variety's overall and location‐specific performance. The results show that the VC and long‐term trends estimations are the worst under MNAR using the single trait for selection. Compared to VC, the long‐term trends estimation is more influenced by the missing mechanism and missing rate. However, the prediction accuracy for variety's performance is mainly driven by the missing rate and is less sensitive to the selection process. If ignoring the genetic and non‐genetic long‐term trends, both estimation and prediction will deteriorate. More testing years would improve estimation and prediction, despite a higher missing rate.

Publisher

Wiley

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3