Abstract
Abstract
Background
Transcription factors bind DNA in specific sequence contexts. In addition to distinguishing one nucleobase from another, some transcription factors can distinguish between unmodified and modified bases. Current models of transcription factor binding tend not to take DNA modifications into account, while the recent few that do often have limitations. This makes a comprehensive and accurate profiling of transcription factor affinities difficult.
Results
Here, we develop methods to identify transcription factor binding sites in modified DNA. Our models expand the standard /// DNA alphabet to include cytosine modifications. We develop Cytomod to create modified genomic sequences and we also enhance the MEME Suite, adding the capacity to handle custom alphabets. We adapt the well-established position weight matrix (PWM) model of transcription factor binding affinity to this expanded DNA alphabet. Using these methods, we identify modification-sensitive transcription factor binding motifs. We confirm established binding preferences, such as the preference of ZFP57 and C/EBPβ for methylated motifs and the preference of c-Myc for unmethylated E-box motifs.
Conclusions
Using known binding preferences to tune model parameters, we discover novel modified motifs for a wide array of transcription factors. Finally, we validate our binding preference predictions for OCT4 using cleavage under targets and release using nuclease (CUT&RUN) experiments across conventional, methylation-, and hydroxymethylation-enriched sequences. Our approach readily extends to other DNA modifications. As more genome-wide single-base resolution modification data becomes available, we expect that our method will yield insights into altered transcription factor binding affinities across many different modifications.
Funder
Natural Sciences and Engineering Research Council of Canada
Canadian Institutes of Health Research
Ministry of Training, Colleges and Universities
Canadian Cancer Society
Ontario Ministry of Research and Innovation
Ontario Institute for Cancer Research
Temerty Faculty of Medicine, University of Toronto
Princess Margaret Cancer Foundation
Agencia Nacional de Investigación y Desarrollo
BLUEPRINT Project
Wellcome Trust
Medical Research Council
National Institute of General Medical Sciences
Publisher
Springer Science and Business Media LLC
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献