Unsupervised Discretization of Continuous Variables in a Chicken Egg Quality Traits Dataset-Reference-Cited by-同舟云学术

Unsupervised Discretization of Continuous Variables in a Chicken Egg Quality Traits Dataset

Published:2017-04-05 Issue:4 Volume:5 Page:315
ISSN:2148-127X
Container-title:Turkish Journal of Agriculture - Food Science and Technology
language:
Short-container-title:Turkish JAF Sci.Tech.

Author:

Cebeci Zeynel,Yıldız Figen

Abstract

Discretization is a data pre-processing task transforming continuous variables into discrete ones in order to apply some data mining algorithms such as association rules extraction and classification trees. In this study we empirically compared the performances of equal width intervals (EWI), equal frequency intervals (EFI) and K-means clustering (KMC) methods to discretize 14 continuous variables in a chicken egg quality traits dataset. We revealed that these unsupervised discretization methods can decrease the training error rates and increase the test accuracies of the classification tree models. By comparing the training errors and test accuracies of the model applied with C5.0 classification tree algorithm we also found that EWI, EFI and KMC methods produced the more or less similar results. Among the rules used for estimating the number of intervals, the Rice rule gave the best result with EWI but not with EFI. It was also found that Freedman-Diaconis rule with EFI and Doane rule with EFI and EWI slightly performed better than the other rules.

Publisher

Turkish Science and Technology Publishing (TURSTEP)

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. HEOD: Human-assisted Ensemble Outlier Detection for cybersecurity;Computers & Security;2024-11

2. HSMM multi-observations for prognostics and health management;Proceedings of the Institution of Mechanical Engineers, Part O: Journal of Risk and Reliability;2024-03-24

3. Breast cancer detection by using associative classifier with rule refinement method based on relevance feedback;Neural Computing and Applications;2022-06-23

4. Discretization of a Continuous Frequency Value in a Model of Socially Significant Behavior;2022 XXV International Conference on Soft Computing and Measurements (SCM);2022-05-25

5. EF_Unique: An Improved Version of Unsupervised Equal Frequency Discretization Method;Arabian Journal for Science and Engineering;2018-03-03