Abstract
AbstractRecent genomic analyses of Archaea have profoundly reshaped our understanding of their distribution, functionalities and roles in eukaryotic evolution. Within the domain, major supergroups are Euryarchaeota, which includes many methanogens, the TACK, which includes Thaumarchaeaota that impact ammonia oxidation in soils and the ocean, the Asgard, which includes lineages inferred to be ancestral to eukaryotes, and the DPANN, a group of mostly symbiotic small-celled archaea. Here, we investigated the extent to which clustering based on protein family content recapitulates archaeal phylogeny and identified the proteins that distinguish the major subdivisions. We also defined 10,866 archaeal protein families that will serve as a community resource. Clustering based on these families broadly recovers the archaeal phylogenetic tree. Interestingly, all major groups are distinguished primarily by the presence of families of conserved hypothetical proteins that are either novel or so highly diverged that their functions are obscured. Given that these hypothetical proteins are near ubiquitous within phyla, we conclude that they were important in the origin of most of the major archaeal lineages.
Publisher
Cold Spring Harbor Laboratory
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献