Author:
Almeida João R.,Pinho Armando J.,Oliveira José L.,Fajarda Olga,Pratas Diogo
Abstract
AbstractSummaryNext-generation sequencing triggered the production of a massive volume of publicly available data and the development of new specialised tools. These tools are dispersed over different frameworks, making the management and analyses of the data a challenging task. Additionally, new targeted tools are needed, given the dynamics and specificities of the field. We present GTO, a comprehensive toolkit designed to unify pipelines in genomic and proteomic research, which combines specialised tools for analysis, simulation, compression, development, visualisation, and transformation of the data. This toolkit combines novel tools with a modular architecture, being an excellent platform for experimental scientists, as well as a useful resource for teaching bioinformatics inquiry to students in life sciences.Availability and implementationGTO is implemented in C language and it is available, under the MIT license, at http://bioinformatics.ua.pt/gto.Contactpratas@ua.ptSupplementary informationSupplementary data are available at publisher’s Web site.
Publisher
Cold Spring Harbor Laboratory
Reference18 articles.
1. DNA sequencing technologies: 2006–2016;Nature Protocols,2017
2. From FASTQ data to high-confidence variant calls: the genome analysis toolkit best practices pipeline;Current Protocols in Bioinformatics,2013
3. ProteoWizard: open source software for rapid proteomics tools development
4. DNA sequences at a glance;PloS one,2013
5. A. J. Pinho , D. Pratas , P. J. Ferreira , S. P. Garcia , Symbolic to numerical conversion of dna sequences using finite-context models, in: 2011 19th European Signal Processing Conference, IEEE, pp. 2024–2028.