Abstract
This study aims to determine the feasibility of machine learning (ML) and patient registration record to be utilised to develop an over-the-counter (OTC) screening model for breast cancer risk estimation. Data were retrospectively collected from women who came to the Hospital Universiti Sains Malaysia, Malaysia for breast-related problems. Eight ML models were used: k-nearest neighbour (kNN), elastic-net logistic regression, multivariate adaptive regression splines, artificial neural network, partial least square, random forest, support vector machine (SVM), and extreme gradient boosting. Features utilised for the development of the screening models were limited to information in the patient registration form. The final model was evaluated in terms of performance across a mammographic density. Additionally, the feature importance of the final model was assessed using the model agnostic approach. kNN had the highest Youden J index, precision, and PR-AUC, while SVM had the highest F2 score. The kNN model was selected as the final model. The model had a balanced performance in terms of sensitivity, specificity, and PR-AUC across the mammographic density groups. The most important feature was the age at examination. In conclusion, this study showed that ML and patient registration information are feasible to be used as the OTC screening model for breast cancer.
Funder
Ministry of Higher Education
Reference89 articles.
1. International variation in female breast cancer incidence and mortality rates;Cancer Epidemiol. Biomark. Prev.,2015
2. (2022, May 24). WHO Breast Cancer. Available online: https://www.who.int/news-room/fact-sheets/detail/breast-cancer.
3. Parks, R.M., Derks, M.G.M., Bastiaannet, E., and Cheung, K.L. (2018). Breast Cancer Management for Surgeons, Springer.
4. Breast Cancer Before Age 40 Years;Semin. Oncol.,2009
5. Epidemiological characteristics of and risk factors for breast cancer in the world;Breast Cancer Targets Ther.,2019
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Hybrid Methods For Classification Of Breast Cancer Using Machine Learning Techniques;2023 2nd International Conference on Vision Towards Emerging Trends in Communication and Networking Technologies (ViTECoN);2023-05-05