Affiliation:
1. Department of Biochemistry and Molecular Biology, University of Georgia, Athens, Georgia 30602-7229
Abstract
ABSTRACT
The original genome annotation of the hyperthermophilic archaeon
Pyrococcus furiosus
contained 2,065 open reading frames (ORFs). The genome was subsequently automatically annotated in two public databases by the Institute for Genomic Research (TIGR) and the National Center for Biotechnology Information (NCBI). Remarkably, more than 500 of the originally annotated ORFs differ in size in the two databases, many very significantly. For example, more than 170 of the predicted proteins differ at their N termini by more than 25 amino acids. Similar discrepancies were observed in the TIGR and NCBI databases with the other archaeal and bacterial genomes examined. In addition, the two databases contain 60 (NCBI) and 221 (TIGR) ORFs not present in the original annotation of
P. furiosus
. In the present study we have experimentally assessed the validity of 88 previously unannotated ORFs. Transcriptional analyses showed that 11 of 61 ORFs examined were expressed in
P. furiosus
when grown at either 95 or 72°C. In addition, 7 of 54 ORFs examined yielded heat-stable recombinant proteins when they were expressed in
Escherichia coli
, although only one of the seven ORFs was expressed in
P. furiosus
under the growth conditions tested. It is concluded that the
P. furiosus
genome contains at least 17 ORFs not previously recognized in the original annotation. This study serves to highlight the discrepancies in the public databases and the problems of accurately defining the number and sizes of ORFs within any microbial genome.
Publisher
American Society for Microbiology
Subject
Molecular Biology,Microbiology
Cited by
39 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献