Affiliation:
1. Department of Computer Science, Xiamen University, Xiamen, China
2. School of Information Science and Engineering, Hunan University, Changsha, China
3. School of Software, Shandong University, Jinan, China
Abstract
Abstract
Motivation
Identification of enhancer–promoter interactions (EPIs) is of great significance to human development. However, experimental methods to identify EPIs cost too much in terms of time, manpower and money. Therefore, more and more research efforts are focused on developing computational methods to solve this problem. Unfortunately, most existing computational methods require a variety of genomic data, which are not always available, especially for a new cell line. Therefore, it limits the large-scale practical application of methods. As an alternative, computational methods using sequences only have great genome-scale application prospects.
Results
In this article, we propose a new deep learning method, namely EPIVAN, that enables predicting long-range EPIs using only genomic sequences. To explore the key sequential characteristics, we first use pre-trained DNA vectors to encode enhancers and promoters; afterwards, we use one-dimensional convolution and gated recurrent unit to extract local and global features; lastly, attention mechanism is used to boost the contribution of key features, further improving the performance of EPIVAN. Benchmarking comparisons on six cell lines show that EPIVAN performs better than state-of-the-art predictors. Moreover, we build a general model, which has transfer ability and can be used to predict EPIs in various cell lines.
Availability and implementation
The source code and data are available at: https://github.com/hzy95/EPIVAN.
Funder
National Natural Science Foundation of China
Project of marine economic innovation and development in Xiamen
Natural Science Foundation of the Higher Education Institutions of Fujian Province
Natural Science Foundation of Fujian Province
Natural Science Foundation of Tianjin City
National Key R&D Program of China
Publisher
Oxford University Press (OUP)
Subject
Computational Mathematics,Computational Theory and Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Statistics and Probability
Cited by
142 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献