Affiliation:
1. School of Computer Science Engineering and Information Systems, Vellore Institute of Technology, Vellore, India
2. Department of Electronics and Communication Engineering, Bannari Amman Institute of Technology, Sathyamangalam, India
Abstract
Renal cancer is among the top 10 cancers in both genders. Microarray gene expression data is one of the effective modalities to diagnose renal cancer. The main objective of this work is to label the gene expression sample as either normal or clear cell renal cell carcinoma. To improve the classification performance and reduce the training time of the above-mentioned supervised classifiers, various feature selection and dimensionality reduction techniques are investigated. Feature selection techniques, namely variance filter, chi-square test, ANOVA test, and mutual information filter, are tested. In addition, principal component analysis, independent component analysis, and linear discriminant analysis are evaluated as dimensionality reduction techniques. Highest balanced accuracy score of 91.6% is attained for support vector machine classifier while it was increased to 94.4% through the usage appropriate dimensionality reduction or feature selection technique.
Reference13 articles.
1. Artificial Intelligence in Cancer Research and Precision Medicine
2. Applications of Machine Learning in Cancer Prediction and Prognosis
3. CuMiDa: An Extensively Curated Microarray Database for Benchmarking and Testing of Machine Learning Approaches in Cancer Research
4. Howlader, N., Noone, A. M., Krapcho, M., Miller, D., Brest, A., Yu, M., Ruhl, J., Tatalovich, Z., Mariotto, A., Lewis, D. R., Chen, H. S., Feuer, E. J., & Cronin, K. A. (2023, July 10). SEER Cancer Statistics Review 1975-2016. National Cancer Institute. https://seer.cancer.gov/csr/1975_2016/
5. Renal cell carcinoma