Abstract
AbstractPrediction and classification of diseases are essential in medical science, as it attempts to immune the spread of the disease and discover the infected regions from the early stages. Machine learning (ML) approaches are commonly used for predicting and classifying diseases that are precisely utilized as an efficient tool for doctors and specialists. This paper proposes a prediction framework based on ML approaches to predict Hepatitis C Virus among healthcare workers in Egypt. We utilized real-world data from the National Liver Institute, founded at Menoufiya University (Menoufiya, Egypt). The collected dataset consists of 859 patients with 12 different features. To ensure the robustness and reliability of the proposed framework, we performed two scenarios: the first without feature selection and the second after the features are selected based on sequential forward selection (SFS). Furthermore, the feature subset selected based on the generated features from SFS is evaluated. Naïve Bayes, random forest (RF), K-nearest neighbor, and logistic regression are utilized as induction algorithms and classifiers for model evaluation. Then, the effect of parameter tuning on learning techniques is measured. The experimental results indicated that the proposed framework achieved higher accuracies after SFS selection than without feature selection. Moreover, the RF classifier achieved 94.06% accuracy with a minimum learning elapsed time of 0.54 s. Finally, after adjusting the hyperparameter values of the RF classifier, the classification accuracy is improved to 94.88% using only four features.
Publisher
Springer Science and Business Media LLC
Subject
Artificial Intelligence,Hardware and Architecture,Human-Computer Interaction,Information Systems,Software
Reference47 articles.
1. WHO (2021) Hepatitis C. Httpswwwwhointnews-Roomfact-Sheetsdetailhepatitis-C
2. Mohamed AA, Elbedewy TA, El-Serafy M et al (2015) Hepatitis C virus: a global view. World J Hepatol 7:2676
3. Huang R, Rao H, Yang M et al (2020) Noninvasive measurements predict liver fibrosis well in hepatitis C virus patients after direct-acting antiviral therapy. Dig Dis Sci 65:1491–1500
4. Westermann C, Peters C, Lisiak B et al (2015) The prevalence of hepatitis C among healthcare workers: a systematic review and meta-analysis. Occup Environ Med 72:880–888
5. John GH, Kohavi R, Pfleger K (1994) Irrelevant features and the subset selection problem. Machine learning proceedings. Elsevier, Amsterdam, pp 121–129
Cited by
38 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献