Physicochemical models of protein–DNA binding with standard and modified base pairs

Author:

Chiu Tsu-Pei1ORCID,Rao Satyanarayan1ORCID,Rohs Remo1234ORCID

Affiliation:

1. Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089

2. Department of Chemistry, University of Southern California, Los Angeles, CA 90089

3. Department of Physics and Astronomy, University of Southern California, Los Angeles, CA 90089

4. Department of Computer Science, University of Southern California, Los Angeles, CA 90089

Abstract

DNA-binding proteins play important roles in various cellular processes, but the mechanisms by which proteins recognize genomic target sites remain incompletely understood. Functional groups at the edges of the base pairs (bp) exposed in the DNA grooves represent physicochemical signatures. As these signatures enable proteins to form specific contacts between protein residues and bp, their study can provide mechanistic insights into protein–DNA binding. Existing experimental methods, such as X-ray crystallography, can reveal such mechanisms based on physicochemical interactions between proteins and their DNA target sites. However, the low throughput of structural biology methods limits mechanistic insights for selection of many genomic sites. High-throughput binding assays enable prediction of potential target sites by determining relative binding affinities of a protein to massive numbers of DNA sequences. Many currently available computational methods are based on the sequence of standard Watson–Crick bp. They assume that the contribution of overall binding affinity is independent for each base pair, or alternatively include dinucleotides or short k -mers. These methods cannot directly expand to physicochemical contacts, and they are not suitable to apply to DNA modifications or non-Watson–Crick bp. These variations include DNA methylation, and synthetic or mismatched bp. The proposed method, DeepRec, can predict relative binding affinities as function of physicochemical signatures and the effect of DNA methylation or other chemical modifications on binding. Sequence-based modeling methods are in comparison a coarse-grain description and cannot achieve such insights. Our chemistry-based modeling framework provides a path towards understanding genome function at a mechanistic level.

Funder

HHS | NIH | National Institute of General Medical Sciences

Human Frontier Science Program

Publisher

Proceedings of the National Academy of Sciences

Subject

Multidisciplinary

Reference53 articles.

1. Origins of Specificity in Protein-DNA Recognition

2. DNAproDB: An expanded database and web-based tool for structural analysis of DNA-protein complexes;Sagendorf J. M.;Nucleic Acids Res.,2020

3. Functional Specificity of a Hox Protein Mediated by the Recognition of Minor Groove Structure

4. Recognition of Specific DNA Sequences

Cited by 2 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3