Quantitative model for genome-wide cyclic AMP receptor protein binding site identification and characteristic analysis

Author:

Chen Yigang12,Lin Yang-Chi-Dung12,Luo Yijun1,Cai Xiaoxuan12,Qiu Peng1,Cui Shidong12,Wang Zhe3,Huang Hsi-Yuan12,Huang Hsien-Da12

Affiliation:

1. School of Medicine, Life and Health Sciences, The Chinese University of Hong Kong , Shenzhen, Longgang District, Shenzhen, Guangdong Province 518172 , China

2. Warshel Institute for Computational Biology, School of Medicine, Life and Health Sciences, The Chinese University of Hong Kong, Shenzhen , Longgang District, Shenzhen, Guangdong Province 518172 , China

3. School of Humanities and Social Science, The Chinese University of Hong Kong, Shenzhen , Longgang District, Shenzhen, Guangdong Province 518172 , China

Abstract

Abstract Cyclic AMP receptor proteins (CRPs) are important transcription regulators in many species. The prediction of CRP-binding sites was mainly based on position-weighted matrixes (PWMs). Traditional prediction methods only considered known binding motifs, and their ability to discover inflexible binding patterns was limited. Thus, a novel CRP-binding site prediction model called CRPBSFinder was developed in this research, which combined the hidden Markov model, knowledge-based PWMs and structure-based binding affinity matrixes. We trained this model using validated CRP-binding data from Escherichia coli and evaluated it with computational and experimental methods. The result shows that the model not only can provide higher prediction performance than a classic method but also quantitatively indicates the binding affinity of transcription factor binding sites by prediction scores. The prediction result included not only the most knowns regulated genes but also 1089 novel CRP-regulated genes. The major regulatory roles of CRPs were divided into four classes: carbohydrate metabolism, organic acid metabolism, nitrogen compound metabolism and cellular transport. Several novel functions were also discovered, including heterocycle metabolic and response to stimulus. Based on the functional similarity of homologous CRPs, we applied the model to 35 other species. The prediction tool and the prediction results are online and are available at: https://awi.cuhk.edu.cn/∼CRPBSFinder.

Funder

National Natural Science Foundation of China

Shenzhen Science and Technology Program

Key Program of Guangdong Basic and Applied Basic Research Fund

Shenzhen City and Longgang District for the Warshel Institute for Computational Biology

Ganghong Young Scholar Development Fund

Undergraduate Research Award

Publisher

Oxford University Press (OUP)

Subject

Molecular Biology,Information Systems

Reference54 articles.

1. Mechanism of activation of catabolite-sensitive genes: a positive control system;Zubay;Proc Natl Acad Sci USA,1970

2. Structure of catabolite gene activator protein at 2.9 a resolution suggests binding to left-handed B-DNA;McKay;Nature,1981

3. Phylogeny of the bacterial superfamily of Crp-Fnr transcription regulators: exploiting the metabolic spectrum by controlling alternative gene programs;Körner;FEMS Microbiol Rev,2003

4. Allosteric regulation of the cAMP receptor protein;Harman;Biochim Biophys Acta,2001

5. cAMP receptor protein from escherichia coli as a model of signal transduction in proteins--a review;Fic;J Mol Microbiol Biotechnol,2009

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3