Affiliation:
1. Department of Genetics, Evolution and Environment, University College London , London , United Kingdom
Abstract
Abstract
The CODEML program in the PAML package has been widely used to analyze protein-coding gene sequences to estimate the synonymous and nonsynonymous rates (dS and dN) and to detect positive Darwinian selection driving protein evolution. For users not familiar with molecular evolutionary analysis, the program is known to have a steep learning curve. Here, we provide a step-by-step protocol to illustrate the commonly used tests available in the program, including the branch models, the site models, and the branch-site models, which can be used to detect positive selection driving adaptive protein evolution affecting particular lineages of the species phylogeny, affecting a subset of amino acid residues in the protein, and affecting a subset of sites along prespecified lineages, respectively. A data set of the myxovirus (Mx) genes from ten mammal and two bird species is used as an example. We discuss a new feature in CODEML that allows users to perform positive selection tests for multiple genes for the same set of taxa, as is common in modern genome-sequencing projects. The PAML package is distributed at https://github.com/abacus-gene/paml under the GNU license, with support provided at its discussion site (https://groups.google.com/g/pamlsoftware). Data files used in this protocol are available at https://github.com/abacus-gene/paml-tutorial.
Funder
Biotechnological and Biological Sciences Research Council
Publisher
Oxford University Press (OUP)
Subject
Genetics,Molecular Biology,Ecology, Evolution, Behavior and Systematics
Reference60 articles.
1. Investigating protein-coding sequence evolution with probabilistic codon substitution models;Anisimova;Mol Biol Evol,2009
2. Effect of recombination on the accuracy of the likelihood method for detecting positive selection at amino acid sites;Anisimova;Genetics,2003
3. Multiple hypothesis testing to detect lineages under positive selection that affects only a few sites;Anisimova;Mol Biol Evol,2007
4. Controlling the false discovery rate: a practical and powerful approach to multiple testing;Benjamini;J R Stat Soc B,1995
5. On the adaptive control of the false discovery rate in multiple testing with independent statistics;Benjamini;J Educat Behav Stat,2000
Cited by
51 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献