Affiliation:
1. Zonguldak Bülent Ecevit Üniversitesi
Abstract
The liver, a life-sustaining organ, plays a substantial role in many body functions. Liver diseases have become an important world health problem in terms of prevalence, incidences, and mortalities. Liver fibrosis/cirrhosis is great of importance, because if not treated in time liver cancer could be occurred and spread to other parts of the body. For this reason, early diagnosis of liver fibrosis/cirrhosis gives significance. Accordingly, this study investigated the performances of different machine learning algorithms for prediction of liver fibrosis/cirrhosis based on demographic and blood values. In this context, random forest, k nearest neighbour, C4.5 decision tree, K-star, random tree and reduced error pruning tree algorithms were used. Two distinct approaches were employed to evaluate the performances of machine learning algorithms. In the first approach, the entire features of dataset were utilized, while in the second approach, only the features selected through principal component analysis were used. Each approach was rigorously assessed using both 10-fold cross-validation and data splitting (70% train and 30% test) techniques. By conducting separate evaluations for each approach, a comprehensive understanding of the effectiveness of utilizing all features versus extracted features based principal component analysis was attained, providing valuable insights into the impact of feature dimensionality reduction on model performance. In this study, all analyses were implemented on WEKA data mining tool. In the first approach, the classification accuracies of random forest algorithm were 89.72% and 90.75% with the application of data splitting (70%-30%) and cross-validation techniques, respectively. In the second approach, where feature reduction is performed using principal component analysis technique, the accuracy values obtained from data splitting and cross-validation techniques of random forest algorithm were 88.61% and 88.83%, respectively. The obtained results revealed out that random forest algorithm outperformed for both approaches. Besides, the application of principal component analysis technique negatively affected the classification performance of used machine learning algorithms. It is thought that the proposed model will guide specialist physicians in making appropriate treatment decisions for patients with liver fibrosis/cirrhosis, potentially leading to death in its advanced stages.
Publisher
Black Sea Journal of Engineering and Science
Reference35 articles.
1. Acarlı K. 2020. Karaciğer sağlığını koruyan 10 hayati öneri. URL: https://www.memorial.com.tr/saglik-rehberi/karaciger-sagligini-koruyan-10-hayati-oneri (accessed date: August 28. 2023).
2. Alaybeyoğlu A, Mulayim N. 2018. Karaciğer kanseri teşhisinde destek vektör makinesi tabanli uzman sistem tasarimi. Tıp Teknolojileri Kongresi, 8-10 Kasım, Gazi Magaso, KKTC, ss: 208-210.
3. Alkuşak E, Gök M. 2014. Karaciğer yetmezliğinin teşhisinde makine öğrenmesi algoritmalarinin kullanimi. ISITES 2014, June 8-10, Karabük, Türkiye, pp: 703-707.
4. Asrani SK, Devarbhavi H, Eaton J, Kamath PS. 2019. Burden of liver diseases in the world. J Hepatol, 70(1): 151-171.
5. Azam MS, Rahman A, Iqbal SHS, Ahmed MT. 2020. Prediction of liver diseases by using few machine learning based approaches. Aust J Eng Innov Technol, 2(5): 85-90.