Author:
Shooshtari Parisa,Feng Samantha,Nelakuditi Viswateja,Asakereh Reza,Hosseini Naghavi Nader,Foong Justin,Brudno Michael,Cotsapas Chris
Abstract
AbstractInternational consortia, including ENCODE, Roadmap Epigenomics, Genomics of Gene Regulation and Blueprint Epigenome have made large-scale datasets of open chromatin regions publicly available. While these datasets are extremely useful for studying mechanisms of gene regulation in disease and cell development, they only identify open chromatin regions in individual samples. A uniform comparison of accessibility of the same regulatory sites across multiple samples is necessary to correlate open chromatin accessibility and expression of target genes across matched cell types. Additionally, although replicate samples are available for majority of cell types, a comprehensive replication-based quality checking of individual regulatory sites is still lacking. We have integrated 828 DNase-I hypersensitive sequencing samples, which we have uniformly processed and then clustered their regulatory regions across all samples. We checked the quality of open-chromatin regions using our replication test. This has resulted in a comprehensive, quality-checked database of Open CHROmatin (OCHROdb) regions for 194 unique human cell types and cell lines which can serve as a reference for gene regulatory studies involving open chromatin. We have made this resource publicly available: users can download the whole database, or query it for their genomic regions of interest and visualize the results in an interactive genome browser.
Funder
Children's Health Research Institute
Natural Sciences and Engineering Research Council of Canada
Ontario Institute for Cancer Research
Schulich School of Medicine and Dentistry, Western University
Genome Canada
Publisher
Springer Science and Business Media LLC