Affiliation:
1. School of Computer Science and Engineering Department, SMVDU, J&K, India
Abstract
Abstract
Mesothelioma is an aggressive lung cancer, harms the linings of the lungs. It is one of the deadliest cancers diagnosed in those exposed to fibrous silicate minerals (asbestos). Millions of people face severe consequences as they are diagnosed at late stages. This study presents a comparison of several machine learning approaches with distinct feature sets and addresses the issue of class imbalance. The dataset used in this study is available publicly on the University of California Irvine (UCI) machine learning repository. This study used the resampling technique, synthetic minority oversampling technique (SMOTE), and adaptive synthetic sampling (ADASYN) to handle the class imbalance. Most of the machine learning strategies performed well with the resampling technique. The best accuracy using the resampling strategy was achieved by artificial neural networks (ANN). The highest accuracy was recorded on the feature set selected by principal component analysis (PCA) is 96%. Overall, ensemble techniques performed well. The proposed stacking-based classifier achieved the highest accuracy (89%) on data balanced using SMOTE and ADASYN.
Publisher
Oxford University Press (OUP)
Cited by
20 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献