Abstract
AbstractRecent study revealed that there are thousands of genes that remain unaffected by increased AoXlnR expression, despite the presence of one or more AoXlnR-binding motifs in their promoter region. Given this knowledge, we designed this study to construct several predictive models for determining whether a gene can exhibit a differential response to changes in AoXlnR expression. These models were constructed using 3D DNA shape information determined using the sequence around the AoXlnR binding motifs with classification as functional or nonfunctional. These models were created using a support vector machine followed by the evaluations designed to determine whether these DNA shape-based models can correctly classify functional motifs in terms of area under the receiver operating characteristic curve. The results showed that the differential expression levels of genes located downstream of the AoXlnR motif are closely related to specific DNA shape information around the binding motifs. Furthermore, we found that the parameters contributing to differential expressions differed depending on the number of motifs in the promoter region by comparing the prediction models using regions with only one binding DNA motif and those with multiple binding DNA motifs.Author SummaryDNA-binding transcription factors (TFs) play a central role in transcriptional regulation mechanisms, mainly through their specific binding to target sites on the genome and regulation of the expression of downstream genes. Therefore, a comprehensive analysis of the function of these TFs will lead to the understanding of various biological mechanisms. However, the functions of TFs in vivo are diverse and complicated, and the identified binding sites on the genome are not necessarily involved in the regulation of downstream gene expression. In this study, we investigated whether DNA structural information around the binding site of transcription factors can be used to predict the involvement of the binding site in the regulation of the expression of genes located downstream of the binding site. Specifically, we calculated the structural parameters based on the DNA shape around the DNA binding motif located upstream of the gene whose expression is directly regulated by the transcription factor AoXlnR from Aspergillus oryzae, and showed that the presence or absence of expression regulation can be predicted from the sequence information with high accuracy by machine learning incorporating these parameters.
Publisher
Cold Spring Harbor Laboratory
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献