Abstract
AbstractThe molecular clock and its phylogenetic applications to genomic data have changed how we study and understand one of the major human pathogens, Mycobacterium tuberculosis (MTB), the causal agent of tuberculosis. Genome sequences of MTB strains sampled at different times are increasingly used to infer when a particular outbreak begun, when a drug resistant clone appeared and expanded, or when a strain was introduced into a specific region. Despite the growing importance of the molecular clock in tuberculosis research, there is a lack of consensus as to whether MTB displays a clocklike behavior and about its rate of evolution. Here we performed a systematic study of the MTB molecular clock on a large genomic data set (6,285 strains), covering most of the global MTB diversity and representing different epidemiological settings. We found wide variation in the degree of clocklike structure among data sets, indicating that sampling times are sometimes insufficient to calibrate the clock of MTB. For data sets with temporal structure, we found that MTB genomes accumulate between 1×10−8 and 5×10−7 nucleotide changes per-site-per-year, which corresponds to 0.04 – 2.2 SNPs per-genome-per-year. Contrary to what expected, these estimates were not dependent on the time of the calibration points as they did not change significantly when we used epidemiological isolates (sampled in the last 40 years) or ancient DNA samples (about 1,000 years old) to calibrate the tree. Additionally, the uncertainty and the discrepancies in the results of different methods were often large, highlighting the importance of using different methods, and of considering carefully their assumptions and limitations.Significance StatementOne of the major recent advancement in evolutionary biology is the development of statistical methods to infer the past evolutionary history of species and populations with genomic data. In the last five years, many researchers have used the molecular clock to study the evolution of Mycobacterium tuberculosis, a bacterial pathogen that causes tuberculosis and is responsible for millions of human deaths every year. The application of the molecular clock to tuberculosis is extremely useful to understand the evolution of drug resistance, the spread of different strains and the origin of the disease. Since some of these studies found contrasting results, we performed a systematic analysis of the molecular clock of MTB. This study will provide an important guideline for future analyses of tuberculosis and other organisms.
Publisher
Cold Spring Harbor Laboratory
Cited by
9 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献