An integrative machine learning framework for classifying SEER breast cancer-Reference-Cited by-同舟云学术

An integrative machine learning framework for classifying SEER breast cancer

Published:2023-04-01 Issue:1 Volume:13 Page:
ISSN:2045-2322
Container-title:Scientific Reports
language:en
Short-container-title:Sci Rep

Author:

Manikandan P.,Durga U.,Ponnuraja C.

Abstract

AbstractBreast cancer is the commonest type of cancer in women worldwide and the leading cause of mortality for females. The aim of this research is to classify the alive and death status of breast cancer patients using the Surveillance, Epidemiology, and End Results dataset. Due to its capacity to handle enormous data sets systematically, machine learning and deep learning has been widely employed in biomedical research to answer diverse classification difficulties. Pre-processing the data enables its visualization and analysis for use in making important decisions. This research presents a feasible machine learning-based approach for categorizing SEER breast cancer dataset. Moreover, a two-step feature selection method based on Variance Threshold and Principal Component Analysis was employed to select the features from the SEER breast cancer dataset. After selecting the features, the classification of the breast cancer dataset is carried out using Supervised and Ensemble learning techniques such as Ada Boosting, XG Boosting, Gradient Boosting, Naive Bayes and Decision Tree. Utilizing the train-test split and k-fold cross-validation approaches, the performance of various machine learning algorithms is examined. The accuracy of Decision Tree for both train-test split and cross validation achieved as 98%. In this study, it is observed that the Decision Tree algorithm outperforms other supervised and ensemble learning approaches for the SEER Breast Cancer dataset.

Publisher

Springer Science and Business Media LLC

Subject

Multidisciplinary

Link

https://www.nature.com/articles/s41598-023-32029-1.pdf

Reference38 articles.

1. https://www.who.int/news-room/fact-sheets/detail/breast-cancer.

2. Bi, W. L. et al. Artificial intelligence in cancer imaging: Clinical challenges and applications. CA Cancer J. Clin. 69, 127–157 (2019).

3. Ibrahim, S., Nazir, S. & Velastin, S. A. Feature selection using correlation analysis and principal component analysis for accurate breast cancer diagnosis. J. Imaging. 7(11), 225. https://doi.org/10.3390/jimaging7110225 (2021).

4. Haq, A. et al. Detection of breast cancer through clinical data using supervised and unsupervised feature selection techniques. IEEE Access. 1, 1–1. https://doi.org/10.1109/ACCESS.2021.3055806 (2021).

5. Liu, S. et al. Survival time prediction of breast cancer patients using feature selection algorithm crystall. IEEE Access 9, 24433–24445. https://doi.org/10.1109/ACCESS.2021.3054823 (2021).

Cited by 15 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Harnessing Fusion Modeling for Enhanced Breast Cancer Classification through Interpretable Artificial Intelligence and In-Depth Explanations;Engineering Applications of Artificial Intelligence;2024-10

2. Computational prediction of phosphorylation sites of SARS-CoV-2 infection using feature fusion and optimization strategies;Methods;2024-09

3. An ensemble classification approach for cervical cancer prediction using behavioral risk factors;Healthcare Analytics;2024-06

4. A Comparative Study of Random Forest and Gradient Boosting Machine Learning Algorithms for Tumor Classification in Biomedical Images;2024 International Conference on Communication, Computer Sciences and Engineering (IC3SE);2024-05-09

5. ML: Early Breast Cancer Diagnosis;Current Problems in Cancer: Case Reports;2024-03