CodAn: predictive models for precise identification of coding regions in eukaryotic transcripts-Reference-Cited by-同舟云学术

CodAn: predictive models for precise identification of coding regions in eukaryotic transcripts

Published:2020-05-27 Issue:3 Volume:22 Page:
ISSN:1467-5463
Container-title:Briefings in Bioinformatics
language:en
Short-container-title:

Author:

Nachtigall Pedro G,Kashiwabara Andre Y,Durham Alan M

Abstract

Abstract Motivation Characterization of the coding sequences (CDSs) is an essential step in transcriptome annotation. Incorrect identification of CDSs can lead to the prediction of non-existent proteins that can eventually compromise knowledge if databases are populated with similar incorrect predictions made in different genomes. Also, the correct identification of CDSs is important for the characterization of the untranslated regions (UTRs), which are known to be important regulators of the mRNA translation process. Considering this, we present CodAn (Coding sequence Annotator), a new approach to predict confident CDS and UTR regions in full or partial transcriptome sequences in eukaryote species. Results Our analysis revealed that CodAn performs confident predictions on full-length and partial transcripts with the strand sense of the CDS known or unknown. The comparative analysis showed that CodAn presents better overall performance than other approaches, mainly when considering the correct identification of the full CDS (i.e. correct identification of the start and stop codons). In this sense, CodAn is the best tool to be used in projects involving transcriptomic data. Availability CodAn is freely available at https://github.com/pedronachtigall/CodAn. Contact aland@usp.br Supplementary information Supplementary data are available at Briefings in Bioinformatics online.

Funder

Fundação de Amparo à Pesquisa do Estado de São Paulo

Coordenação de Aperfeiçoamento de Pessoal de Nível Superior

Conselho Nacional de Pesquisa

Publisher

Oxford University Press (OUP)

Subject

Molecular Biology,Information Systems

Link

http://academic.oup.com/bib/article-pdf/22/3/bbaa045/37963033/bbaa045.pdf

Reference41 articles.

1. Regulation of eukaryotic gene expression by the untranslated gene regions and other non–coding elements;Lucy;Cell Mol Life Sc,2012