Affiliation:
1. The Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK
Abstract
Discrepancies in gene and orphan number indicated by previous analyses suggest thatS. cerevisiaewould benefit from a consistent re-annotation. In this analysis three new genes are identified and 46 alterations to gene coordinates are described. 370 ORFs are defined as totally spurious ORFs which should be disregarded. At least a further 193 genes could be described as very hypothetical, based on a number of criteria. It was found that disparate genes with sequence overlaps over ten amino acids (especially at the N-terminus) are rare in both S.cerevisiaeandSz.pombe. A new S.cerevisiaegene number estimate with an upper limit of 5804 is proposed, but after the removal of very hypothetical genes and pseudogenes this is reduced to 5570. Although this is likely to be closer to the true upper limit, it is still predicted to be an overestimate of gene number. A complete list of revised gene coordinates is available from the Sanger Centre (S. cerevisiaereannotation: ftp://ftp/pub/yeast/SCreannotation).
Subject
Genetics,Molecular Biology,Biotechnology
Reference25 articles.
1. Basic local alignment search tool
2. 1994. A generalized profile syntax for biomolecular sequences motifs and its function in automatic sequence interpretation. In ISMB-94; Proceedings 2nd International Conference on Intelligent Systems for Molecular Biology. AAAIPress; 53-61.
3. The SWISS-PROT protein sequence data bank and its supplement TrEMBL in 1999
4. Pfam 3.1: 1313 multiple alignments and profile HMMs match the majority of proteins
5. Dating the evolutionary radiations of the true fungi
Cited by
67 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献