Author:
Niu Kun,Luo Ximei,Zhang Shumei,Teng Zhixia,Zhang Tianjiao,Zhao Yuming
Abstract
Enhancers are regulatory DNA sequences that could be bound by specific proteins named transcription factors (TFs). The interactions between enhancers and TFs regulate specific genes by increasing the target gene expression. Therefore, enhancer identification and classification have been a critical issue in the enhancer field. Unfortunately, so far there has been a lack of suitable methods to identify enhancers. Previous research has mainly focused on the features of the enhancer’s function and interactions, which ignores the sequence information. As we know, the recurrent neural network (RNN) and long short-term memory (LSTM) models are currently the most common methods for processing time series data. LSTM is more suitable than RNN to address the DNA sequence. In this paper, we take the advantages of LSTM to build a method named iEnhancer-EBLSTM to identify enhancers. iEnhancer-ensembles of bidirectional LSTM (EBLSTM) consists of two steps. In the first step, we extract subsequences by sliding a 3-mer window along the DNA sequence as features. Second, EBLSTM model is used to identify enhancers from the candidate input sequences. We use the dataset from the study of Quang H et al. as the benchmarks. The experimental results from the datasets demonstrate the efficiency of our proposed model.
Subject
Genetics (clinical),Genetics,Molecular Medicine
Reference61 articles.
1. Prediction of bio-sequence modifications and the associations with diseases.;Ao;Briefin. Funct. Genom.
2. Prediction of antioxidant proteins using hybrid feature representation method and random forest.;Ao;Genomics
3. Knowledge-powered deep learning for word embedding. in Joint European conference on machine learning and knowledge discovery in databases;Bian;Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2014. Lecture Notes in Computer Science,2014
4. Computational and biological methods for gene therapy.;Cheng;Curr. Gene Ther.,2019
5. Computational Methods for identifying similar diseases.;Cheng;Mol. Ther. Nucleic Acids Res.,2019
Cited by
28 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献