Affiliation:
1. Institute for Advanced Study, Einstein Dr., Princeton, New Jersey 08540
2. Computational Biology Group, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave. N, Seattle, Washington 98109
Abstract
ABSTRACT
In the last few years, the genomic sequence data for thousands of influenza A virus strains, including the 1918 pandemic strain, and hundreds of isolates of the avian influenza virus H5N1, which is causing an increasing number of human fatalities, have become publicly available. This large quantity of sequence data allows us to do comparative genomics with the human and avian versions of the virus. We find that the nucleotide compositions of influenza A viruses infecting the two hosts are sufficiently different that we can determine the host at almost 100% accuracy. This assignment works at the segment level, which allows us to construct the reassortment history of individual segments within each strain. We suggest that the different nucleotide compositions can be explained by a host-dependent mutation bias. To support this idea, we estimate the fixation rates for the different polymerase segments and the ratios of synonymous to nonsynonymous changes. Additionally, we provide evidence supporting the hypothesis that the H1N1 influenza virus entered the human population just prior to the 1918 outbreak, with an earliest bound of 1910.
Publisher
American Society for Microbiology
Subject
Virology,Insect Science,Immunology,Microbiology
Cited by
105 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献