Affiliation:
1. College of Computer Science, King Khalid University, Abha 62529, Saudi Arabia
2. Jeddah University, Jeddah, Saudi Arabia
3. Information Technology College, Ajloun National University, Jordan
Abstract
Breast cancer affects one in every eight women and is the most common cancer. Aim. To diagnose breast cancer, a potentially fatal condition, using microarray technology, large datasets can now be used. Methods. This study used machine learning algorithms and IOT to classify microarray data. They were created from two sets of data: one with 1919 protein types and one with 24481 protein types for 97 people, 46 of whom had a recurring disease and 51 of whom did not. The apps were written in Python. Each classification algorithm was applied to the data separately, without any feature elimination or size reduction. Second, two alternative feature reduction approaches were compared to the first case. In this case, machine learning techniques like Adaboost and Gradient Boosting Machine are used. Results. Before applying any feature reduction techniques, the logistic regression method produced the best results (90.23%), while the Random Forest method produced good results (67.22%). In the first data, SVM had the highest accuracy rate of 99.23% in both approaches, while in the second data, SVM had the highest rate of 87.87% in RLR and 88.82% in LTE. Deep learning was also done with MLP. The relationship between depth and classification accuracy was studied using it at various depths. After a while, the accuracy rate declined as the number of layers increased. The maximum accuracy rate in the first data was 97.69%, while it was 68.72% in the second. As a result, adding layers to deep learning does not improve classification accuracy.
Subject
General Mathematics,General Medicine,General Neuroscience,General Computer Science
Cited by
14 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献