Abstract
AbstractStatistical data-mining (DM) and machine learning (ML) are promising tools to assist in the analysis of complex dataset. In recent decades, in the precision of agricultural development, plant phenomics study is crucial for high-throughput phenotyping of local crop cultivars. Therefore, integrated or a new analytical approach is needed to deal with these phenomics data. We proposed a statistical framework for the analysis of phenomics data by integrating DM and ML methods. The most popular supervised ML methods; Linear Discriminant Analysis (LDA), Random Forest (RF), Support Vector Machine with linear (SVM-l) and radial basis (SVM-r) kernel are used for classification/prediction plant status (stress/non-stress) to validate our proposed approach. Several simulated and real plant phenotype datasets were analyzed. The results described the significant contribution of the features (selected by our proposed approach) throughout the analysis. In this study, we showed that the proposed approach removed phenotype data analysis complexity, reduced computational time of ML algorithms, and increased prediction accuracy.
Publisher
Springer Science and Business Media LLC
Reference38 articles.
1. Rahaman, M. M., Chen, D., Gillani, Z., Klukas, C. & Chen, M. Advanced phenotyping and phenotype data analysis for the study of plant growth and development. Front Plant Sci 6, 619, https://doi.org/10.3389/fpls.2015.00619 (2015).
2. Granier, C. & Vile, D. Phenotyping and beyond: modelling the relationships between traits. Curr Opin Plant Biol 18, 96–102, https://doi.org/10.1016/j.pbi.2014.02.009 S1369-5266(14)00025-9 [pii] (2014).
3. Singh, A., Ganapathysubramanian, B., Singh, A. K. & Sarkar, S. Machine Learning for High-Throughput Stress Phenotyping in Plants. Trends Plant Sci 21, 110–124, https://doi.org/10.1016/j.tplants.2015.10.015 (2016).
4. Zhao, J., Bodner, G. & Rewald, B. Phenotyping: using machine learning for improved pairwise genotype classification based on root traits. Frontiers in plant science 7, 1864 (2016).
5. Dudoit, S., Fridlyand, J. & Speed, T. P. Comparison of discrimination methods for the classification of tumors using gene expression data. Journal of the American statistical association 97, 77–87 (2002).
Cited by
27 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献