Affiliation:
1. National Superior Institute of Computer Science (ESI), Constantine, Algeria
2. Lire Laboratory, University of Constantine-2, Constantine, Algeria
Abstract
The clustering process is used to identify cancer subtypes based on gene expression and DNA methylation datasets, since cancer subtype information is critically important for understanding tumor heterogeneity, detecting previously unknown clusters of biological samples, which are usually associated with unknown types of cancer will, in turn, gives way to prescribe more effective treatments for patients. This is because cancer has varying subtypes which often respond disparately to the same treatment. While the DNA methylation database is extremely large-scale datasets, running time still remains a major challenge. Actually, traditional clustering algorithms are too slow to handle biological high-dimensional datasets, they usually require large amounts of computational time. The proposed clustering algorithm extraordinarily overcomes all others in terms of running time, it is able to rapidly identify a set of biologically relevant clusters in large-scale DNA methylation datasets, its superiority over the others has been demonstrated regarding its relative speed.
Subject
Decision Sciences (miscellaneous),Computational Mathematics,Computational Theory and Mathematics,Control and Optimization,Computer Science Applications,Modeling and Simulation,Statistics and Probability
Reference26 articles.
1. Aggarwal, C. (2014). An Introduction to Cluster Analysis. In Data Clustering: Algorithms and Applications (pp. 1-27).
2. Alelyani, S., Tang, J., & Liu, H. (2014). Feature Selection for Clustering: A Review. In Data Clustering: Algorithms and Applications (pp. 29-60). Chapman and Hall/CRC.
3. Anastasiu, D. C., Tagarelli, A., & Karypis, G. (2014). Document Clustering: The Next Frontier. In Data Clustering: Algorithms and Applications (pp. 305-337).
4. Andreopoulos, B. (2014). Clustering Categorical Data. In Data Clustering: Algorithms and Applications (pp. 277-303).
5. Effective Clustering Algorithms for Gene Expression Data;T.Chandrasekhar;International Journal of Computers and Applications,2011
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献