Author:
Corcoran David L.,Feingold Eleanor,Dominick Jessica,Wright Marietta,Harnaha Jo,Trucco Massimo,Giannoukakis Nick,Benos Panayiotis V.
Abstract
The search for mammalian DNA regulatory regions poses a challenging problem in computational biology. The short length of the DNA patterns compared with the size of the promoter regions and the degeneracy of the patterns makes their identification difficult. One way to overcome this problem is to use evolutionary information to reduce the number of false-positive predictions. We developed a novel method for pattern identification that compares a pair of putative binding sites in two species (e.g., human and mouse) and assigns two probability scores based on the relative position of the sites in the promoter and their agreement with a known model of binding preferences. We tested the algorithm's ability to predict known binding sites on various promoters. Overall, it exhibited 83% sensitivity and the specificity was 72%, which is a clear improvement over existing methods. Our algorithm also successfully predicted two novel NF-κB binding sites in the promoter region of the mouse autotaxin gene (ATX, ENPP2), which we were able to verify by using chromatin immunoprecipitation assay coupled with quantitative real-time PCR.
Publisher
Cold Spring Harbor Laboratory
Subject
Genetics(clinical),Genetics
Reference39 articles.
1. Barash, Y., Elidan, G., Friedman, N., and Kaplan, T. 2003. Modeling dependencies in protein-DNA binding sites. In Seventh Annual International Conference on Computational Molecular Biology (RECOMB).
2. Benos, P.V., Lapedes, A.S., Fields, D.S., and Stormo, G.D. 2001. SAMIE: Statistical algorithm for modeling interaction energies. Pac. Symp. Biocomput. 115-126.
3. Additivity in protein-DNA interactions: how good an approximation is it?
4. Probabilistic Code for DNA Recognition by Proteins of the EGR Family
Cited by
19 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献