Affiliation:
1. Department of Computer, Jing-De-Zhen Ceramic university, 333403, Jing-De-Zhen, China
Abstract
Abstract:
Promoters are DNA fragments located near the transcription initiation site, they can be divided
into strong promoter type and weak promoter type according to transcriptional activation and
expression level. Identifying promoters and their strengths in DNA sequences is essential for understanding
gene expression regulation. Therefore, it is crucial to further improve predictive quality of
predictors for real-world application requirements. Here, we constructed the latest training dataset
based on the RegalonDB website, where all the promoters in this dataset have been experimentally
validated, and their sequence similarity is less than 85%. We used one-hot and nucleotide chemical
property and density (NCPD) to represent DNA sequence samples. Additionally, we proposed an ensemble
deep learning framework containing a multi-head attention module, long short-term memory
present, and a convolutional neural network module.
The results showed that iPSI(2L)-EDL outperformed other existing methods for both promoter prediction
and identification of strong promoter type and weak promoter type, the AUC and MCC for
the iPSI(2L)-EDL in identifying promoter were improved by 2.23% and 2.96% compared to that of
PseDNC-DL on independent testing data, respectively, while the AUC and MCC for the iPSI(2L)-
EDL were increased by 3.74% and 5.86% in predicting promoter strength type, respectively. The results
of ablation experiments indicate that CNN plays a crucial role in recognizing promoters, the importance
of different input positions and long-range dependency relationships among features are
helpful for recognizing promoters.
Furthermore, to make it easier for most experimental scientists to get the results they need, a userfriendly
web server has been established and can be accessed at http://47.94.248.117/IPSW(2L)-EDL.
Publisher
Bentham Science Publishers Ltd.
Subject
Computational Mathematics,Genetics,Molecular Biology,Biochemistry
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献