Author:
Grishin Nick V.,Wolf Yuri I.,Koonin Eugene V.
Abstract
Accumulation of complete genome sequences of diverse organisms creates new possibilities for evolutionary inferences from whole-genome comparisons. In the present study, we analyze the distributions of substitution rates among proteins encoded in 19 complete genomes (the interprotein rate distribution). To estimate these rates, it is necessary to employ another fundamental distribution, that of the substitution rates among sites in proteins (the intraprotein distribution). Using two independent approaches, we show that intraprotein substitution rate variability appears to be significantly greater than generally accepted. This yields more realistic estimates of evolutionary distances from amino-acid sequences, which is critical for evolutionary-tree construction. We demonstrate that the interprotein rate distributions inferred from the genome-to-genome comparisons are similar to each other and can be approximated by a single distribution with a long exponential shoulder. This suggests that a generalized version of the molecular clock hypothesis may be valid on genome scale. We also use the scaling parameter of the obtained interprotein rate distribution to construct a rooted whole-genome phylogeny. The topology of the resulting tree is largely compatible with those of global rRNA-based trees and trees produced by other approaches to genome-wide comparison.
Publisher
Cold Spring Harbor Laboratory
Subject
Genetics(clinical),Genetics
Reference44 articles.
1. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
2. The genome sequence of Rickettsia prowazekii and the origin of mitochondria
3. Pfam 3.1: 1313 multiple alignments and profile HMMs match the majority of proteins
4. Archaea and the prokaryote-to-eukaryote transition.;Brown;Microbiol. Mol. Biol. Rev.,1997
5. Dayhoff M.O. Schwartz R.M. Orcutt B.C. (1978) A model of evolutionary change in proteins. in Atlas of protein sequences and structures, ed Dayhoff M.O. (National Biomedical Research Foundation, Washington, DC.) pp 345–352.
Cited by
69 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献