Cancer Classification From DNA Microarray Using Genetic Algorithms and Case-Based Reasoning
-
Published:2023-12-29
Issue:
Volume:
Page:378-399
-
ISSN:
-
Container-title:Research Anthology on Bioinformatics, Genomics, and Computational Biology
-
language:ng
-
Short-container-title:
Author:
Machacha Lilybert1, Bhattacharya Prabir2
Affiliation:
1. Botho University, Gaborone, Botswana 2. Concordia University, Canada
Abstract
There are many similarities in the symptoms of several types of cancer and that makes it sometimes difficult for the physicians to do an accurate diagnosis. In addition, it is a technical challenge to classify accurately the cancer cells in order to differentiate one type of cancer from another. The DNA microarray technique (also called the DNA chip) has been used in the past for the classification of cancer but it generates a large volume of noisy data that has many features, and is difficult to analyze directly. This paper proposes a new method, combining the genetic algorithm, case-based reasoning, and the k-nearest neighbor classifier, which improves the performance of the classification considerably. The authors have also used the well-known Mahalanobis distance of multivariate statistics as a similarity measure that improves the accuracy. A case-based classifier approach together with the genetic algorithm has never been applied before for the classification of cancer, same with the application of the Mahalanobis distance. Thus, the proposed approach is a novel method for the cancer classification. Furthermore, the results from the proposed method show considerably better performance than other algorithms. Experiments were done on several benchmark datasets such as the leukemia dataset, the lymphoma dataset, ovarian cancer dataset, and breast cancer dataset.
Reference57 articles.
1. Instance-based learning algorithms 2. Alizadeh, A. A., Eisen, M. B., Davis, R. E., Ma, C., Lossos, I. S., Rosenwald, A., Boldrick, J. C., Sabet, H., Tran, T., Yu, X., Powell, J. I., Yang, L., Marti, G. E., Moore, T., Hudson, J., Lu, L., Lewis, D. B., Tibshirani, R., Sherlock, G., . . . Chan, W. C. (2000). Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling. Nature, 403, 503-511. 3. Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays 4. Amaratunga, D., Cabrera, J., & Shkedy, Z. (2014). Exploration and analysis of DNA Microarray and Protein Array Data (2nd ed.). Wiley Sons. 5. Data Analysis and Visualization in Genomics and Proteomics
|
|