Affiliation:
1. Departamento de Ciencias Computacionales, Universidad de Guadalajara, Guadalajara, Mexico
Abstract
Genomic signal processing (GSP) methods which convert DNA data to numerical values have recently been proposed, which would offer the opportunity of employing existing digital signal processing methods for genomic data. One of the most used methods for exploring data is cluster analysis which refers to the unsupervised classification of patterns in data. In this paper, we propose a novel approach for performing cluster analysis of DNA sequences that is based on the use of GSP methods and the K-means algorithm. We also propose a visualization method that facilitates the easy inspection and analysis of the results and possible hidden behaviors. Our results support the feasibility of employing the proposed method to find and easily visualize interesting features of sets of DNA data.
Subject
General Agricultural and Biological Sciences,General Biochemistry, Genetics and Molecular Biology,General Medicine,General Neuroscience
Reference51 articles.
1. Evolution of the primate cytochrome c oxidase subunit II gene;Adkins;Journal of Molecular Evolution,1994
2. On DNA numerical representations for period-3 based exon prediction;Akhtar,2007
3. Signal processing in sequence analysis: advances in eukaryotic gene prediction;Akhtar;Journal of Selected Topics in Signal Processing,2008
4. Frequency-domain analysis of biomolecular sequences;Anastassiou;Bioinformatics,2000
5. Numerical taxonomy and cluster analysis;Baikey,1994
Cited by
22 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献