Affiliation:
1. Department of Mathematics and Statistics, University of Ottawa, Ottawa, Canada K1N 6N5
Abstract
Whole-genome doubling, tripling or replicating to a greater degree, due to fixation of polyploidization events, is attested in almost all lineages of the flowering plants, recurring in the ancestry of some plants two, three or more times in retracing their history to the earliest angiosperm. This major mechanism in plant genome evolution, which generally appears as instantaneous on the evolutionary time scale, sets in operation a compensatory process called fractionation, the loss of duplicate genes, initially rapid, but continuing at a diminishing rate over millions and tens of millions of years. We study this process by statistically comparing the distribution of duplicate gene pairs as a function of their time of creation through polyploidization, as measured by sequence similarity. The stochastic model that accounts for this distribution, though exceedingly simple, still has too many parameters to be estimated based only on the similarity distribution, while the computational procedures for compiling the distribution from annotated genomic data is heavily biased against earlier polyploidization events—syntenic ‘crumble’. Other parameters, such as the size of the initial gene complement and the ploidy of the various events giving rise to duplicate gene pairs, are even more inaccessible to estimation. Here, we show how the frequency of
unpaired
genes, identified via their embedding in stretches of duplicate pairs, together with previously established constraints among some parameters, adds enormously to the range of successive polyploidization events that can be analysed. This also allows us to estimate the initial gene complement and to correct for the bias due to crumble. We explore the applicability of our methodology to four flowering plant genomes covering a range of different polyploidization histories.
Funder
Canada Research Chairs
University of Ottawa
Natural Sciences and Engineering Research Council of Canada
Subject
Biomedical Engineering,Biomaterials,Biochemistry,Bioengineering,Biophysics,Biotechnology
Reference25 articles.
1. Zhang Y Sankoff D. 2017 The similarity distribution of paralogous gene pairs created by recurrent alternation of polyploidization and fractionation. In Comparative genomics. RECOMB-CG 2017 (eds J Meidanis L Nakhleh). Lecture Notes in Computer Science vol. 10562 pp. 1–13. (doi:10.1007/978-3-319-67979-2-1)
2. Pinning down ploidy in paleopolyploid plants
3. Models for Similarity Distributions of Syntenic Homologs and Applications to Phylogenomics
4. A branching process for homology distribution-based inference of polyploidy, speciation and loss
5. Branching out to speciation in a model of fractionation: the Malvaceae;Zhang Y;IEEE/ACM Trans. Comput. Biol. Bioinform.
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献