Morphological Traits Evaluated with Random Forest Method Explains Natural Classification of Grapevine (Vitis vinifera L.) Cultivars

Author:

Szűgyi-Reiczigel Zsófia,Ladányi MártaORCID,Bisztray György Dénes,Varga ZsuzsannaORCID,Bodor-Pesti Péter

Abstract

There are hundreds of morphologic and morphometric traits available to classify and identify grapevine (Vitis vinifera L.) genotypes, while statistical evaluation of those has certain limitations, especially when we have no information about the traits that are discriminative to a certain sample set. High numbers of investigated characters could cause redundancy, while reducing those numbers may result in data loss. Grapevine is one of the most important horticultural crops, with many cultivars in production. The characterization of the genotypes is of undeniably high importance. In this study, we analyzed a dataset of scientific and historical importance with 125 morphological traits of 97 grapevine cultivars described by Németh in 1966. However, the traits are not independent in a set of a large number of categorical traits with too few cultivars. Therefore, the number of traits was first reduced using a simple and effective algorithm to eliminate traits with redundant information content using the asymmetric measure of association Goodman and Kruskal’s λ. We reduced the number of traits from 125 to 59 without any information loss. For the classification, we applied a random forest (RF) method. In this way, 93% of the cultivars were correctly classified using only four traits of the data set. To our knowledge, only a few studies applied a trait elimination algorithm similar to ours in ampelography that can be used for other biological data sets of similar structure. The classification results give a morphological explanation to several cultivars from the Carpathian Basin, a territory where all three Vitis vinifera L. geographical groups, occidentalis, orientalis and pontica, are represented. We found that the information-loss-avoiding data reduction method we applied in our study solved the redundancy-caused interdependencies and provided a suitable dataset for classifying grapevine genotypes. For example, this method may successfully be applied in digital image analysis-based traditional morphometric investigations in ampelography.

Publisher

MDPI AG

Subject

Plant Science,Ecology,Ecology, Evolution, Behavior and Systematics

Reference61 articles.

1. Mullins, G.M., Bouquet, A., and Williams, L.E. (1992). Biology of the Grapevine, Cambridge University Press.

2. OIV Focus 2017 (2017). Vine Varieties Distribution in the World. 4, OIV.

3. Molon, G. (1906). Ampelografia. Descrizione delle migliori varietá di viti. Ed., Ulrico Hoepli.

4. Brandenburg, W.A. (2000). Meclatis in Clematis: Yellow Flowering Clematis Species. Systematic Studies in Clematis L. (Ranunculaceae), Inclusive of Cultonomic Aspects, Wageningen Universiteit.

5. Negrul, A.M. (1959). Vinogradarstvo, Gosudarstvennoye Izdatelstvo Selscohoznistvennoy Literaturi.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3