Mapping Biomolecular Sequences: Graphical Representations - Their
Origins, Applications and Future Prospects
-
Published:2022-03
Issue:3
Volume:25
Page:354-364
-
ISSN:1386-2073
-
Container-title:Combinatorial Chemistry & High Throughput Screening
-
language:en
-
Short-container-title:CCHTS
Affiliation:
1. Theory Division, Centre for Interdisciplinary Research and Education, Kolkata 700068, India
Abstract
:
The exponential growth in the depositories of biological sequence data has generated an
urgent need to store, retrieve and analyse the data efficiently and effectively for which the standard
practice of using alignment procedures are not adequate due to high demand on computing
resources and time. Graphical representation of sequences has become one of the most popular
alignment-free strategies to analyse the biological sequences where each basic unit of the
sequences – the bases adenine, cytosine, guanine and thymine for DNA/RNA, and the 20 amino
acids for proteins – are plotted on a multi-dimensional grid. The resulting curve in 2D and 3D
space and the implied graph in higher dimensions provide a perception of the underlying
information of the sequences through visual inspection; numerical analyses, in geometrical or
matrix terms, of the plots provide a measure of comparison between sequences and thus enable
study of sequence hierarchies. The new approach has also enabled studies of comparisons of DNA
sequences over many thousands of bases and provided new insights into the structure of the base
compositions of DNA sequences. In this article we review in brief the origins and applications of
graphical representations and highlight the future perspectives in this field.
Publisher
Bentham Science Publishers Ltd.
Subject
Organic Chemistry,Computer Science Applications,Drug Discovery,General Medicine