Enhanced machine learning based feature subset through FFS enabled classification for cervical cancer diagnosis

Author:

B Nithya1,V Ilango2

Affiliation:

1. New Horizon College of Engineering, Bengaluru, India

2. Department of MCA, CMR Institute of Technology, Bengaluru, India

Abstract

A dataset that has massive features and imbalanced classes may be challenging for obtaining adequate accuracy in classification approaches of Machine Learning (ML). The purpose of this research is to find the optimal feature subset for cervical cancer diagnosis with efficient classification approach by estimating the performance of various Machine Learning predictive models. Filter-based feature selection techniques of Relief and Information Gain are applied in this study to calculate the rank for each feature that can be applied to order and select highest scoring features for feature selection. An optimal feature subset is generated with wrapper approach through Recursive Feature Elimination which uses a Random Forest procedure and Genetic Algorithm has been employed based on evolutionary principle. The predictive models are established with 10fold cross validation using prevalent classification algorithms like Random Forest, C5.0, K-Nearest Neighbour and Naïve Bayes. The results showed an enhancement in the average performance of these classifiers concurrently and the classification error for these classifiers decreases substantially. The experiments also exhibited that by employing this approach an optimal and reduced feature subset is desirable for the enrichment of classification accuracy with a lower computational cost. The features generated by fused approach of Relief and Genetic algorithm methods were able to predict the results in an efficient manner, hence an optimal feature subset has been nominated through this procedure. Maximum number of classifiers have shown good results in terms of performance outcomes. In addition, Random Forest method has shown advanced accuracy rate with an improved percentage of sensitivity and specificity results. Also, this work established that the best and optimal feature subset selection through Fused Feature Selection (FFS) approach could reduce the complexity of the predictive model.

Publisher

IOS Press

Subject

Artificial Intelligence,Control and Systems Engineering,Software

Reference14 articles.

1. Sagala NTM. A comparative study of data mining methods to diagnose cervical cancer. J Phys Conf Ser. 2019; 1255(1).

2. A quantum hybrid PSO combined with fuzzy k-NN approach to feature selection and cell classification in cervical cancer detection;Iliyasu;Sensors (Switzerland),2017

3. A hybrid feature selection method to improve performance of a group of classification algorithms;Naseriparsa;Int J Comput Appl,2013

4. A Hybrid Intelligent System Framework for the Prediction of Heart Disease Using Machine Learning Algorithms

5. Fast SFFS-based algorithm for feature selection in biomedical datasets;Shirbani;Amirkabir Int J Sci Res,2013

Cited by 2 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3