Author:
Avni Eliran,Montoya Dennis,Lopez David,Modlin Robert,Pellegrini Matteo,Snir Sagi
Abstract
AbstractBackgroundPseudogenes are non-functional sequences in the genome with homologous sequences that are functional (i.e. genes). They are abundant in eukaryotes where they have been extensively investigated, while in prokaryotes they are significantly scarcer and less well studied. Here we conduct a comprehensive analysis of the evolution of pseudogenes in prokaryotes. For this analysis we consider a broad range of bacteria, but also focus on the leprosy pathogen Mycobacterium leprae, which contains an unusually large number of pseudogenes which comprise approximately 40% of its entire genome.ResultsWe have developed an informatics-based approach to characterize the evolution of pseudogenes. This approach combines tools from phylogenomics, genomics, and transcriptomics. The results we obtain suggest the presence of two mechanisms for pseudogene formation: failed horizontal gene transfer events and disruption of native genes.ConclusionsWe conclude that while in most bacteria the former is most likely responsible for the majority of pseudogenization events, in mycobacteria, and in particular in M. leprae with its exceptionally high pseudogene numbers, the latter predominates. We believe that our study sheds new light on the evolution of pseudogenes in bacteria, by utilizing new methodologies that are applied to the unusually abundant M. leprae pseudogenes and their orthologs. As such, we anticipate that it will be of broad interest to both evolutionary biologists as well as microbiologists.
Publisher
Cold Spring Harbor Laboratory