Structure-based learning to predict and model protein–DNA interactions and transcription-factor co-operativity in cis-regulatory elements

Author:

Fornes Oriol1,Meseguer Alberto2,Aguirre-Plans Joachim3,Gohl Patrick2,Bota Patricia M2,Molina-Fernández Ruben2,Bonet Jaume24,Chinchilla-Hernandez Altair5,Pegenaute Ferran5,Gallego Oriol5ORCID,Fernandez-Fuentes Narcis6,Oliva Baldo2ORCID

Affiliation:

1. Centre for Molecular Medicine and Therapeutics. BC Children's Hospital Research Institute. Department of Medical Genetics. University of British Columbia , Vancouver , BC  V5Z 4H4 , Canada

2. Structural Bioinformatics Lab (GRIB-IMIM). Department of Medicine and Life Sciences, Universitat Pompeu Fabra , Barcelona 08005  Catalonia , Spain

3. Center for Complex Network Research. Northeastern University , Boston , MA  02115 , USA

4. Laboratory of Protein Design & Immunoengineering. School of Engineering. Ecole Polytechnique Federale de Lausanne. Lausanne 1015 , Vaud , Switzerland

5. Live-Cell Structural Biology. Department of Medicine and Life Sciences, Universitat Pompeu Fabra , Barcelona 08005  Catalonia , Spain

6. Institute of Biological, Environmental and Rural Science. Aberystwyth University , SY23 3DA Aberystwyth , UK

Abstract

Abstract Transcription factor (TF) binding is a key component of genomic regulation. There are numerous high-throughput experimental methods to characterize TF–DNA binding specificities. Their application, however, is both laborious and expensive, which makes profiling all TFs challenging. For instance, the binding preferences of ∼25% human TFs remain unknown; they neither have been determined experimentally nor inferred computationally. We introduce a structure-based learning approach to predict the binding preferences of TFs and the automated modelling of TF regulatory complexes. We show the advantage of using our approach over the classical nearest-neighbor prediction in the limits of remote homology. Starting from a TF sequence or structure, we predict binding preferences in the form of motifs that are then used to scan a DNA sequence for occurrences. The best matches are either profiled with a binding score or collected for their subsequent modeling into a higher-order regulatory complex with DNA. Co-operativity is modelled by: (i) the co-localization of TFs and (ii) the structural modeling of protein–protein interactions between TFs and with co-factors. We have applied our approach to automatically model the interferon-β enhanceosome and the pioneering complexes of OCT4, SOX2 (or SOX11) and KLF4 with a nucleosome, which are compared with the experimentally known structures.

Funder

HFSP

MCIN

Agencia Estatal de Investigación

Generalitat de Catalunya

Publisher

Oxford University Press (OUP)

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3