Tight basis cycle representatives for persistent homology of large biological data sets-Reference-Cited by-同舟云学术

Tight basis cycle representatives for persistent homology of large biological data sets

Published:2023-05-30 Issue:5 Volume:19 Page:e1010341
ISSN:1553-7358
Container-title:PLOS Computational Biology
language:en
Short-container-title:PLoS Comput Biol

Author:

Aggarwal Manu^ORCID,Periwal Vipul

Abstract

Persistent homology (PH) is a popular tool for topological data analysis that has found applications across diverse areas of research. It provides a rigorous method to compute robust topological features in discrete experimental observations that often contain various sources of uncertainties. Although powerful in theory, PH suffers from high computation cost that precludes its application to large data sets. Additionally, most analyses using PH are limited to computing the existence of nontrivial features. Precise localization of these features is not generally attempted because, by definition, localized representations are not unique and because of even higher computation cost. Such a precise location is a sine qua non for determining functional significance, especially in biological applications. Here, we provide a strategy and algorithms to compute tight representative boundaries around nontrivial robust features in large data sets. To showcase the efficiency of our algorithms and the precision of computed boundaries, we analyze the human genome and protein crystal structures. In the human genome, we found a surprising effect of the impairment of chromatin loop formation on loops through chromosome 13 and the sex chromosomes. We also found loops with long-range interactions between functionally related genes. In protein homologs with significantly different topology, we found voids attributable to ligand-interaction, mutation, and differences between species.

Funder

National Institute of Diabetes and Digestive and Kidney Diseases

Publisher

Public Library of Science (PLoS)

Subject

Computational Theory and Mathematics,Cellular and Molecular Neuroscience,Genetics,Molecular Biology,Ecology,Modeling and Simulation,Ecology, Evolution, Behavior and Systematics

Reference42 articles.

1. Uncertainty Modeling and Analysis in Engineering and the Sciences

2. Chromatin loops in gene regulation;S Kadauke;Biochimica et Biophysica Acta (BBA)-Gene Regulatory Mechanisms,2009

3. Organizational principles of 3D genome architecture;MJ Rowley;Nature Reviews Genetics,2018

4. Persistent homology analysis of brain artery trees;P Bendich;The annals of applied statistics,2016

5. Topological data analysis of zebrafish patterns;MR McGuirl;Proceedings of the National Academy of Sciences,2020

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Geometric and topological characterization of the cytoarchitecture of islets of Langerhans;PLOS Computational Biology;2023-11-09