Abstract
To date basic visualization of sequence alignments have largely focused on displaying per-site columns of nucleotide, or amino acid, residues along with associated frequency summarizations. The persistence of this tendency to the recent tools designed for viewing mapped read data indicates that such a perspective not only provides a reliable visualization of per-site alterations, but also offers implicit reassurance to the end-user in relation to data accessibility. However, the initial insight gained is limited, something that is especially true when viewing alignments consisting of many sequences representing differing factors such as location, date and subtype. A basic alignment viewer can have potential to increase initial insight through visual enhancement, whilst not delving into the realms of complex sequence analysis. We present CView, a visualizer that expands on the per-site representation of residues through the incorporation of a dynamic network that is based on the summarization of diversity present across different regions of the alignment. Within the network, nodes are based on the clustering of sequence fragments that span windows placed consecutively along the alignment. Edges are placed between nodes of neighbouring windows where they share sequence identification(s), i.e. different regions of the same sequence(s). Thus, if a node is selected on the network, then the relationship that sequences passing through that node have to other regions of diversity within the alignment can be observed through path tracing. In addition to augmenting visual insight, CView provides export features including variant summarization, per-site residue and kmer frequencies, consensus sequence, alignment dissection as well as clustering; each useful across a range of research areas. The software has been designed to be user friendly, intuitive and interactive. It is open source and an executable jar, source code, quick start, usage tutorial and test data are available (under the GNU General Public License) from https://sourceforge.net/projects/cview/.
Funder
Fundação para a Ciência e a Tecnologia
European Regional Development Fund
Publisher
Public Library of Science (PLoS)
Reference53 articles.
1. MUSCLE: multiple sequence alignment with high accuracy and high throughput;R. Edgar;Nucleic Acids Res,2004
2. Clustal W and Clustal X version 2.0;M Larkin;Bioinformatics,2007
3. Viewing multiple sequence alignments with the JavaScript Sequence Alignment Viewer (JSAV).;ACR Martin;F1000Research 2014 3249,2014
4. Sequence, a BioJS component for visualising sequences.;J Gomez;F1000Research,2014
5. DNAAlignEditor: DNA alignment editor tool;H Sanchez-Villeda;BMC Bioinformatics,2008