A Comparative Analysis of Classification Algorithms on Diverse Datasets-Reference-Cited by-同舟云学术

A Comparative Analysis of Classification Algorithms on Diverse Datasets

Published:2018-04-19 Issue:2 Volume:8 Page:2790-2795
ISSN:1792-8036
Container-title:Engineering, Technology & Applied Science Research
language:
Short-container-title:Eng. Technol. Appl. Sci. Res.

Author:

Alghobiri M.

Abstract

Data mining involves the computational process to find patterns from large data sets. Classification, one of the main domains of data mining, involves known structure generalizing to apply to a new dataset and predict its class. There are various classification algorithms being used to classify various data sets. They are based on different methods such as probability, decision tree, neural network, nearest neighbor, boolean and fuzzy logic, kernel-based etc. In this paper, we apply three diverse classification algorithms on ten datasets. The datasets have been selected based on their size and/or number and nature of attributes. Results have been discussed using some performance evaluation measures like precision, accuracy, F-measure, Kappa statistics, mean absolute error, relative absolute error, ROC Area etc. Comparative analysis has been carried out using the performance evaluation measures of accuracy, precision, and F-measure. We specify features and limitations of the classification algorithms for the diverse nature datasets.

Publisher

Engineering, Technology & Applied Science Research

Link

https://etasr.com/index.php/ETASR/article/download/1952/pdf

Reference32 articles.

1. N. M. Ramos, J. M. Delgado, R. M. Almeida, M. L. Simoes, S. Manuel, Appliation of Data Mining Techniques in the Analysis of Indoor Hygrothermal Conditions, Springer, 2015

2. B. Bakhshinategh, O. R. Zaiane, S. ElAtia, D. Ipperciel, “Educational data mining applications and tasks: A survey of the last 10 years”, Education and Information Technologies, Vol. 23, No. 1, pp. 537-553, 2018

3. F. Ahmed, M. Samorani, C. Bellinger, O. R. Zaiane, “Advantage of integration in big data: Feature generation in multi-relational databases for imbalanced learning”, IEEE International Conference on Big Data, Washington, DC, USA, pp. 532-539, December 5-8, 2016

4. P. G. Clark, C. Gao, J. W. Grzymala-Busse, “MLEM2 Rule Induction Algorithm with Multiple Scanning Discretization”, Smart Innovation, Systems and Technologies, Vol. 72, pp. 218-227, Springer, 2017

5. H. U. Khan, A. Daud, U. Ishfaq, T. Amjad, N. Aljohani, R. A. Abbasi, J. S. Alowibdi, “Modelling to identify influential bloggers in the blogosphere: a survey”, Computers in Human Behavior, Vol. 68, pp. 64-82, 2017

Cited by 14 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Navigating techniques in job recommender systems on internship profile matching: a systematic review;Journal of Research in Innovative Teaching & Learning;2024-08-01

2. Improving the Effectiveness of E-learning Videos by leveraging Eye-gaze Data;Engineering, Technology & Applied Science Research;2023-12-05

3. Selection of Classification and Regression Algorithms for Knowledge Discovery –A Review;2022 Seventh International Conference on Parallel, Distributed and Grid Computing (PDGC);2022-11-25

4. Identifying Cyberspace Users’ Tendency in Blog Writing Using Machine Learning Algorithms;Engineering Mathematics and Computing;2022-10-04

5. Data Mining Regarding Cyberbullying in the Arabic Language on Instagram Using KNIME and Orange Tools;Engineering, Technology & Applied Science Research;2022-10-02