Abstract
AbstractStrictly ultrametric matrices appear in many domains of mathematics and science; nevertheless, they can be large and dense, making them difficult to store and manipulate, unlike large but sparse matrices. In this manuscript, we exploit that strictly ultrametric matrices can be represented as binary trees to sparsify them via an orthonormal base change based on Haar-like wavelets. We show that, with overwhelmingly high probability, only an asymptotically negligible fraction of the off-diagonal entries in random but large strictly ultrametric matrices remain non-zero after the base change; and develop an algorithm to sparsify such matrices directly from their tree representation. We also identify the subclass of matrices diagonalized by the Haar-like wavelets and supply a sufficient condition to approximate the spectrum of strictly ultrametric matrices outside this subclass. Our methods give computational access to the covariance matrix of the microbiologists’ Tree of Life, which was previously inaccessible due to its size, and motivate introducing a new wavelet-based (beta-diversity) metric to compare microbial environments. Unlike the established (beta-diversity) metrics, the new metric may be used to identify internal nodes (i.e., splits) in the Tree that link microbial composition and environmental factors in a statistically significant manner.MSC codes05C05, 15A18, 42C40, 65F55, 92C70
Publisher
Cold Spring Harbor Laboratory
Reference62 articles.
1. A Rambaut , Figtree v1.3.1. institute of evolutionary biology, university of edinburgh, edinburgh, 2010.
2. D. J. Aldous , Stochastic models and descriptive statistics for phylogenetic trees, from Yule to today, Statistical Science, (2001), pp. 23–34.
3. On statistical tests of phylogenetic tree imbalance: The Sackin and other indices revisited
4. Which Random Processes Describe the Tree of Life? A Large-Scale Study of Phylogenetic Tree Imbalance
5. I. Borg and P. J. Groenen , Modern Multidimensional Scaling: Theory and Applications, Springer, 2nd ed., 2005.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献