Interpretable predictive models of genome-wide aryl hydrocarbon receptor-DNA binding reveal tissue-specific binding determinants

Author:

Filipovic David123ORCID,Qi Wenjie123,Kana Omar345,Marri Daniel13,LeCluyse Edward L6,Andersen Melvin E7,Cuddapah Suresh8,Bhattacharya Sudin13459ORCID

Affiliation:

1. Department of Biomedical Engineering, Michigan State University , East Lansing, Michigan 48824, USA

2. Department of Computational Mathematics, Science and Engineering, Michigan State University , East Lansing, Michigan 48824, USA

3. Institute for Quantitative Health Science & Engineering, Michigan State University , East Lansing, Michigan 48824, USA

4. Department of Pharmacology & Toxicology, Michigan State University , East Lansing, Michigan 48824, USA

5. Institute for Integrative Toxicology, Michigan State University , East Lansing, Michigan 48824, USA

6. LifeSciences Division, LifeNet Health, Research Triangle Park , North Carolina 27709, USA

7. ScitoVation LLC , Durham, North Carolina 27713, USA

8. Division of Environmental Medicine, Department of Medicine, New York University School of Medicine , New York, New York 10010, USA

9. Center for Research on Ingredient Safety, Michigan State University , East Lansing, Michigan 48824, USA

Abstract

Abstract The aryl hydrocarbon receptor (AhR) is an inducible transcription factor whose ligands include the potent environmental contaminant 2,3,7,8-tetrachlorodibenzo-p-dioxin (TCDD). Ligand-activated AhR binds to DNA at dioxin response elements (DREs) containing the core motif 5′-GCGTG-3′. However, AhR binding is highly tissue specific. Most DREs in accessible chromatin are not bound by TCDD-activated AhR, and DREs accessible in multiple tissues can be bound in some and unbound in others. As such, AhR functions similarly to many nuclear receptors. Given that AhR possesses a strong core motif, it is suited for a motif-centered analysis of its binding. We developed interpretable machine learning models predicting the AhR binding status of DREs in MCF-7, GM17212, and HepG2 cells, as well as primary human hepatocytes. Cross-tissue models predicting transcription factor (TF)-DNA binding generally perform poorly. However, reasons for the low performance remain unexplored. By interpreting the results of individual within-tissue models and by examining the features leading to low cross-tissue performance, we identified sequence and chromatin context patterns correlated with AhR binding. We conclude that AhR binding is driven by a complex interplay of tissue-agnostic DRE flanking DNA sequence and tissue-specific local chromatin context. Additionally, we demonstrate that interpretable machine learning models can provide novel and experimentally testable mechanistic insights into DNA binding by inducible TFs.

Funder

United States Drug Administration

National Institute of Food and Agriculture

Michigan AgBioResearch

National Institute of Environmental Health Sciences

National Institutes of Health

Publisher

Oxford University Press (OUP)

Subject

Toxicology

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3