CLASSIFICATION MODEL FOR BREAST CANCER MAMMOGRAMS

Author:

Mohamad Samuri SuzaniORCID,Nova Try VianandaORCID,Rahmatullah BahbibiORCID,Wang Shir LiORCID,Al-Qaysi Z.T

Abstract

Machine learning has been the topic of interest in research related to early detection of breast cancer based on mammogram images. In this study, we compare the performance results from three (3) types of machine learning techniques: 1) Naïve Bayes (NB), 2) Neural Network (NN) and 3) Support Vector Machine (SVM) with 2000 digital mammogram images to choose the best technique that could model the relationship between the features extracted and the state of the breast (‘Normal’ or ‘Cancer’). Grey Level Co-occurrence Matrix (GLCM) which represents the two dimensions of the level variation gray in the image is used in the feature extraction process. Six (6) attributes consist of contrast, variance, standard deviation, kurtosis, mean and smoothness were computed as feature extracted and used as the inputs for the classification process. The data has been randomized and the experiment has been repeated for ten (10) times to check for the consistencies of the performance of all techniques. 70% of the data were used as the training data and another 30% used as testing data. The result after ten (10) experiments show that, Support Vector Machine (SVM) gives the most consistent results in correctly classifying the state of the breast as ‘Normal’ or ‘Cancer’, with the accuracy of 99.4%, in training and 98.76% in testing. The SVM classification model has outperformed NN and NB model in the study, and it shows that SVM is a good choice for determining the state of the breast at the early stage. ABSTRAK: Pembelajaran mesin telah menjadi topik yang diminati dalam penyelidikan yang berkaitan dengan pengesanan awal kanser payudara berdasarkan imej mamogram. Dalam kajian ini, kami membandingkan hasil prestasi dari tiga (3) jenis teknik pembelajaran mesin: 1) Naïve Bayes (NB), 2) Neural Network (NN) dan 3) Support Vector Machine (SVM) dengan 2000 imej digital mammogram hingga teknik terbaik yang dapat memodelkan hubungan antara ciri yang diekstraksi dan keadaan payudara ('Normal' atau 'Cancer') dapat diperoleh. Grey Level Co-occurrence Matrix (GLCM) yang mewakili dua dimensi variasi tahap kelabu pada gambar digunakan dalam proses pengekstrakan ciri. Enam (6) atribut terdiri dari kontras, varians, sisihan piawai, kurtosis, min dan kehalusan dihitung sebagai fitur yang diekstrak dan digunakan sebagai input untuk proses klasifikasi. Eksperimen telah diulang selama sepuluh (10) kali untuk memeriksa kesesuaian prestasi semua teknik. 70% data digunakan sebagai data latihan dan 30% lagi digunakan sebagai data ujian. Hasil setelah sepuluh (10) eksperimen menunjukkan bahawa, Support Vector Machine (SVM) memberikan hasil yang paling konsisten dalam mengklasifikasikan keadaan payudara dengan betul sebagai 'Normal' atau 'Kanser', dengan akurasi 99.4%, dalam latihan dan 98.76% dalam ujian. Model klasifikasi SVM telah mengungguli model NN dan NB dalam kajian ini, dan ia menunjukkan bahawa SVM adalah pilihan yang baik untuk menentukan keadaan payudara pada peringkat awal.

Publisher

IIUM Press

Subject

Applied Mathematics,General Engineering,General Chemical Engineering,General Computer Science

Reference29 articles.

1. Rubin R. (2017) Do Screening Mammograms Cut Breast Cancer Deaths or Lead to Overtreatment? Probably Both. Forbes. Retrieved from https://www.forbes.com/sites/ritarubin/2017/01/10/do-screening-mammograms-cut-breast-cancer-deaths-or-lead-to-overtreatment-probably-both/

2. Mohamad Samuri, Suzani, Megariani TV. (2019) Intelligent 3D Analysis for Detection and Classification of Breast Cancer. JITCE (Journal of Information Technology and Computer Engineering), 3(2): 96-103. https://doi.org/https://doi.org/10.25077/jitce.3.02.96-103.2019

3. Hosni M, Abnane I, Idri A, de Gea JMC, Alemán JLF. (2019) Reviewing ensemble classification methods in breast cancer. Computer methods and programs in biomedicine, 177: 89-112.

4. Dubois D, Prade, H. (2016) Practical Methods for Constructing Possibility Distributions. Int. J. Intell. Syst., 31: 215-239. https://doi.org/10.1002/int.21782

5. Mohanaiah P, Sathyanarayana P, GuruKumar L. (2013) Image texture feature extraction using GLCM approach. International journal of scientific and research publications, 3(5): 1.

Cited by 4 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3