Abstract
AbstractThe majority of disease-associated variants identified through genome-wide association studies (GWAS) are located outside of protein-coding regions and are overrepresented in sequences that regulate gene expression. Prioritizing candidate regulatory variants and potential biological mechanisms for further functional experiments, such as genome editing, can be challenging, especially in regions with a high number of variants in strong linkage disequilibrium or multiple proximal gene targets. Improved annotation of the regulatory genome can help identify promising variants and target genes for functional genomics experiments. To advance this area, we developed FORGEdb (https://forge2.altiusinstitute.org/files/forgedb.html), a web-based tool that can rapidly integrate data for individual genetic variants, providing information on associated regulatory elements, transcription factor (TF) binding sites and target genes for over 37 million variants. FORGEdb uses annotations derived from data across a wide range of biological samples to delineate the regulatory context for each variant at the cell type level. Multiple data types, such as Combined Annotation Dependent Depletion (CADD) scores, expression quantitative trait loci (eQTLs), activity-by-contact (ABC) interactions, Contextual Analysis of TF Occupancy (CATO) scores, transcription factor (TF) motifs, DNase I hotspots, histone mark ChIP-seq peaks and chromatin states, are included in FORGEdb and these annotations are integrated into a FORGEdb score to guide assessment of functional importance. In summary, FORGEdb provides an expansive and unique resource of genomic annotations and an integrated score that can be used to accelerate the translation of identified genetic loci into biological insight.
Publisher
Cold Spring Harbor Laboratory
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献