Author:
Nagle Michael F.,Yuan Jialin,Kaur Damanpreet,Ma Cathleen,Peremyslova Ekaterina,Jiang Yuan,de Rivera Alexa Niño,Jawdy Sara,Chen Jin-Gui,Feng Kai,Yates Timothy B.,Tuskan Gerald A.,Muchero Wellington,Fuxin Li,Strauss Steven H.
Abstract
AbstractPlant regeneration is an important dimension of plant propagation, and a key step in the production of transgenic plants. However, regeneration capacity varies widely among genotypes and species, the molecular basis of which is largely unknown. While association mapping methods such as genome-wide association studies (GWAS) have long demonstrated abilities to help uncover the genetic basis of trait variation in plants, the power of these methods relies on the accuracy and scale of phenotypic data used. To enable a largescale GWAS ofin plantaregeneration in model treePopulus, we implemented a workflow involving semantic segmentation to quantify regenerating plant tissues (callus and shoot) over time. We found the resulting statistics are of highly non-normal distributions, which necessitated transformations or permutations to avoid violating assumptions of linear models used in GWAS. While transformations can lead to a loss of statistical power, we demonstrate that this can be mitigated by the application of the Augmented Rank Truncation method, or avoided altogether using the Multi-Threaded Monte Carlo SNP-set (Sequence) Kernel Association Test to compute empiricalp-values in GWAS. We report over 200 statistically supported candidate genes, with top candidates including regulators of cell adhesion, stress signaling, and hormone signaling pathways, as well as other diverse functions. We demonstrate that sensitive genetic discovery for complex developmental traits can be enabled by a workflow based on computer vision and adaptation of several statistical approaches necessitated by to the complexity of regeneration trait expression and distribution.
Publisher
Cold Spring Harbor Laboratory
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献