Cooler: scalable storage for Hi-C data and other genomically-labeled arrays-Reference-Cited by-同舟云学术

Cooler: scalable storage for Hi-C data and other genomically-labeled arrays

Published:2019-02-22 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Abdennur Nezar^ORCID,Mirny Leonid^ORCID

Abstract

Most existing coverage-based (epi)genomic datasets are one-dimensional, but newer technologies probing interactions (physical, genetic, etc.) produce quantitative maps with two-dimensional genomic coordinate systems. Storage and computational costs mount sharply with data resolution when such maps are stored in dense form. Hence, there is a pressing need to develop data storage strategies that handle the full range of useful resolutions in multidimensional genomic datasets by taking advantage of their sparse nature, while supporting efficient compression and providing fast random access to facilitate development of scalable algorithms for data analysis. We developed a file format called cooler, based on a sparse data model, that can support genomically-labeled matrices at any resolution. It has the flexibility to accommodate various descriptions of the data axes (genomic coordinates, tracks and bin annotations), resolutions, data density patterns, and metadata. Cooler is based on HDF5 and is supported by a Python library and command line suite to create, read, inspect and manipulate cooler data collections. The format has been adopted as a standard by the NIH 4D Nucleome Consortium. Cooler is cross-platform, BSD-licensed, and can be installed from the Python Package Index or the bioconda repository. The source code is maintained on Github at https://github.com/mirnylab/cooler.

Publisher

Cold Spring Harbor Laboratory

Reference32 articles.

1. Capturing Chromosome Conformation

2. The second decade of 3C technologies: detailed insights into nuclear organization

3. Comprehensive Mapping of Long-Range Interactions Reveals Folding Principles of the Human Genome

4. How best to identify chromosomal interactions: a comparison of approaches;Nature methods,2017

5. The Hitchhiker’s guide to Hi-C analysis: Practical guidelines

Cited by 17 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Non-coding variants impact cis-regulatory coordination in a cell type-specific manner;Genome Biology;2024-07-18

2. Mitotic chromosomes scale to nuclear-cytoplasmic ratio and cell size in Xenopus;eLife;2023-04-25

3. Targeted cohesin loading characterizes the entry and exit sites of loop extrusion trajectories;2023-01-04

4. Cooltools: enabling high-resolution Hi-C analysis in Python;2022-11-01

5. Dynamic chromatin organization and regulatory interactions in human endothelial cell differentiation;2022-04-16