Affiliation:
1. King Faisal University, Saudi Arabia
2. University Technology PETRONAS, Malaysia
Abstract
DNA is considered the building block of living species. DNA sequence alignment and analysis have been big challenges for the scientists for many years. This research presents a comparative analysis of state of the art software engineering approaches for sequence analysis, i.e. genome sequences in particular. Sequence analysis problems are NP hard and need optimal solutions. The underlying problems stated are duplicate sequence detection, sequence matching by relevance, and sequence analysis by approximate comparison in general and by using tools, i.e. Matlab and multi-lingual sequence analysis. The usefulness of these operations is also highlighted, and future expectations are described. The proposal describes the concepts, tools, methodologies, and algorithms being used for sequence analysis. The sequences contain the precious information that needs to be mined for useful purposes. There is high concentration required to model the optimal solution. The similarity and alignments concepts cannot be addressed directly with one technique or algorithm; a better performance is achieved by the comprehension of different concepts.
Reference23 articles.
1. Agrawal, A., & Huang, X. (2008). Pairwise DNA alignment with sequence specific transition-transversion ratio using multiple parameter sets. International Conference on Information Technology, (pp. 89–93).
2. Agrawal, A., & Khaitan, S. K. (2008). A new heuristic for multiple sequence alignment. IEEE International Conference on Electro/Information Technology, (pp. 215–217).
3. Ahmad, M., Khan, H. M., & Mathkour, H. (2009). An integrated statistical comparative analysis between variant genetic datasets of Mus musculus. International Journal of Computational Intelligence in Bioinformatics and System Biology, 1(2), 163-176. ISSN: 1755-8042
4. Ahmad, M., & Mathkour, H. (2009). A pattern matching approach for redundancy detection in bi-lingual and mono-lingual corpora. IAENG 2009.
5. Ahmad, M., & Mathkour, H. (2009). Genome sequence analysis: A survey. Journal of Computer Science, 5(9), 651-660. ISSN 1549-3636