Breast Cancer Prediction Using Fine Needle Aspiration Features and Upsampling with Supervised Machine Learning

Author:

Shafique Rahman1ORCID,Rustam Furqan2ORCID,Choi Gyu Sang1ORCID,Díez Isabel de la Torre3ORCID,Mahmood Arif4,Lipari Vivian567,Velasco Carmen Lili Rodríguez589,Ashraf Imran1ORCID

Affiliation:

1. Department of Information and Communication Engineering, Yeungnam University, Gyeongsan 38541, Republic of Korea

2. School of Computer Science, University College Dublin, D04 V1W8 Dublin, Ireland

3. Department of Signal Theory and Communications and Telematic Engineering, University of Valladolid, Paseo de Belén 15, 47011 Valladolid, Spain

4. Department of Computer Science & Information Technology, The Islamia University of Bahawalpur, Bahawalpur 63100, Punjab, Pakistan

5. Research Group on Foods, Nutritional Biochemistry and Health, Universidad Europea del Atlántico, Isabel Torres 21, 39011 Santander, Spain

6. Department of Project Management, Universidad Internacional Iberoamericana, Campeche 24560, Mexico

7. Fundación Universitaria Internacional de Colombia Bogotá, Bogotá 11001, Colombia

8. Department of Project Management, Universidad Internacional Iberoamericana Arecibo, Arecibo, PR 00613, USA

9. Project Management, Universidade Internacional do Cuanza, Cuito EN250, Bié, Angola

Abstract

Breast cancer is one of the most common invasive cancers in women and it continues to be a worldwide medical problem since the number of cases has significantly increased over the past decade. Breast cancer is the second leading cause of death from cancer in women. The early detection of breast cancer can save human life but the traditional approach for detecting breast cancer disease needs various laboratory tests involving medical experts. To reduce human error and speed up breast cancer detection, an automatic system is required that would perform the diagnosis accurately and timely. Despite the research efforts for automated systems for cancer detection, a wide gap exists between the desired and provided accuracy of current approaches. To overcome this issue, this research proposes an approach for breast cancer prediction by selecting the best fine needle aspiration features. To enhance the prediction accuracy, several feature selection techniques are applied to analyze their efficacy, such as principal component analysis, singular vector decomposition, and chi-square (Chi2). Extensive experiments are performed with different features and different set sizes of features to investigate the optimal feature set. Additionally, the influence of imbalanced and balanced data using the SMOTE approach is investigated. Six classifiers including random forest, support vector machine, gradient boosting machine, logistic regression, multilayer perceptron, and K-nearest neighbors (KNN) are tuned to achieve increased classification accuracy. Results indicate that KNN outperforms all other classifiers on the used dataset with 20 features using SVD and with the 15 most important features using a PCA with a 100% accuracy score.

Funder

European University of the Atlantic

Publisher

MDPI AG

Subject

Cancer Research,Oncology

Reference49 articles.

1. Underreported Breast and Cervical Cancer Deaths Among Brought-In-Dead Cases in Zambia;Haruyama;JCO Glob. Oncol.,2021

2. Breast cancer statistics, 2022;Giaquinto;CA Cancer J. Clin.,2022

3. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries;Bray;CA Cancer J. Clin.,2018

4. Primary care physicians and cancer care in Pakistan: A short narrative;Ali;J. Cancer Policy,2020

5. (2021, March 26). WHO, Available online: https://www.who.int/news-room/fact-sheets/detail/breast-cancer.

Cited by 18 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3