Abstract
Abstract
Cervical cancer becomes a major cause of cancer deaths in women around the world. The objective of this study is to provide a comprehensive analysis of different data mining methods to diagnose the malignant cancer samples. Different data mining algorithms (SVM, Naïve Bayes, and KNN) has been applied on four different medical tests (Biopsy, Cytology, Hinselmann, and Schiller) as four different target variables. The attributes influence the disease most is extracted since the disease has no symptoms in the early stage. The extraction involved over 32 attributes and two different algorithms such as Correlation-based Filter (CFS) and Random Forest. The results showed that the performance of Naïve Bayes classifier outperforms other classifiers after evaluation using 10-fold cross-validation method in R environment. In addition, the use of attribute selection has been proved not only can select the highly important attributes but also to increase the performance of all classifiers on cervical cancer dataset. In this study, the work reveals the classifiers can effectively achieve the best performance with the least number of highly important attributes.
Subject
General Physics and Astronomy
Reference14 articles.
1. Cancer burden in the year 2000. The global picture;Parkin;Eur J Cancer,2001
2. The relationship between Age and Histological Types of Cervical Cancer;Irabor;Int. J. Of Scientific Research,2018
3. Data-Driven Diagnosis of Cervical Cancer with Support Vector Machine-Base Approaches;Wu,2017
4. Cervical cancer prediction using data mining;Punjani;Int. J. for Research in Applied Science & Eng. Tech. (IJRASET),2017
5. A survey on Data Mining Approaches for Healthcare;Tomar;Int. J. of Bio-Science and Bio-Tech.,2013
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献