A Contemporary Machine Learning Method for Accurate Prediction of Cervical Cancer-Reference-Cited by-同舟云学术

A Contemporary Machine Learning Method for Accurate Prediction of Cervical Cancer

Published:2021 Issue: Volume:102 Page:04004
ISSN:2261-2424
Container-title:SHS Web of Conferences
language:
Short-container-title:SHS Web Conf.

Author:

Jeremiah Tanimu Jesse,Hamada Mohamed,Hassan Mohammed,Yusuf Ilu Saratu

Abstract

With the advent of new technologies in the medical field, huge amounts of cancerous data have been collected and are readily accessible to the medical research community. Over the years, researchers have employed advanced data mining and machine learning techniques to develop better models that can analyze datasets to extract the conceived patterns, ideas, and hidden knowledge. The mined information can be used as a support in decision making for diagnostic processes. These techniques, while being able to predict future outcomes of certain diseases effectively, can discover and identify patterns and relationships between them from complex datasets. In this research, a predictive model for predicting the outcome of patients’ cervical cancer results has been developed, given risk patterns from individual medical records and preliminary screening tests. This work presents a Decision tree (DT) classification algorithm and shows the advantage of feature selection approaches in the prediction of cervical cancer using recursive feature elimination technique for dimensionality reduction for improving the accuracy, sensitivity, and specificity of the model. The dataset employed here suffers from missing values and is highly imbalanced. Therefore, a combination of under and oversampling techniques called SMOTETomek was employed. A comparative analysis of the proposed model has been performed to show the effectiveness of feature selection and class imbalance based on the classifier’s accuracy, sensitivity, and specificity. The DT with the selected features and SMOTETomek has better results with an accuracy of 98%, sensitivity of 100%, and specificity of 97%. Decision Tree classifier is shown to have excellent performance in handling classification assignment when the features are reduced, and the problem of imbalance class is addressed.

Publisher

EDP Sciences

Link

https://www.shs-conferences.org/10.1051/shsconf/202110204004/pdf

Reference29 articles.

1. Big Data Recommendations for Industrial–Organizational Psychology

2. Data mining

3. Hassan M., Ph.D. thesis, The University of Aizu (2018)

4. Cluster analysis for diabetic retinopathy prediction using data mining techniques

5. Genetic Algorithm Approaches for Improving Prediction Accuracy of Multi-criteria Recommender Systems

Cited by 12 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Comprehensive analysis of artificial intelligence techniques for gynaecological cancer: symptoms identification, prognosis and prediction;Artificial Intelligence Review;2024-07-29

2. Prediction of Cervical Cancer With Application of Machine Learning Models;Advances in Healthcare Information Systems and Administration;2024-06-30

3. Performance Comparison of XGBoost and LightGBM Gradient Boosting Algorithms in Predicting Cervical Cancer Risk;2024 International Conference on Computing and Data Science (ICCDS);2024-04-26

4. Investigation on explainable machine learning models to predict chronic kidney diseases;Scientific Reports;2024-02-14

5. Cervical Cancer Prediction Using Machine Learning Techniques;Lecture Notes in Networks and Systems;2024