Author:
Doulah Md. Siraj-Ud-,Islam Md. Nazmul
Abstract
Machine learning is one of the fast-growing areas of computer science, with far-reaching applications. There are several applications for machine learning. The most significant of which is supervised learning. Supervised learning is common in classification problems. In this study, frequently used twelve machine learning algorithms are considered: NB, LDA, LR, ANN, SVM, K-NN, HT, DT, C4.5, CART, RF and BB. We apply these algorithms on seven datasets. The main goal of this study was to evaluate the performance of the machine learning algorithms on both binary and multiple classification problems using a variety of performance metrics: accuracy, kappa statistic, precision, recall, specificity, F-measure, MAE, RMSE and MCC. Here, we found that RF algorithm proved to have the best performance in three out of seven datasets. But the other four algorithms: NN, NB, BB and LR also performed well.
Reference41 articles.
1. Agarap, A. F. M. (2018). On Breast Cancer Detection: An Application of Machine, Learning Algorithms on the Wisconsin Diagnostic Dataset, arXiv:1711.07831v4.
2. Caruana, R., and Niculescu-Mizil, A. (2006). An Empirical Comparison of Supervised Learning Algorithms. Proceedings of the 23rd International Conference on Machine Learning.
3. Helwana et al. (2017). Machine learning techniques for classification of breast tissue, 9th International Conference on Theory and Application of Soft Computing, 120, 402–410.
4. Tekur. A., and Prerna, J. (2018). A Study on Classification Algorithms for Predicting Colon Cancer using Gene Tissue Parameters. IJPAM, 119(18), 2147-2166.
5. Doulah, M.S.U. (2019). A Comparison among Twenty-Seven Normality Tests. Research and Reviews: Journal of Statistics, 8(3), 41-59.