Author:
Liang Yunyun, ,Zhang Shengli,Qiao Huijuan,Cheng Yinan, ,
Abstract
<abstract>
<p>Enhancer is a non-coding DNA fragment that can be bound with proteins to activate transcription of a gene, hence play an important role in regulating gene expression. Enhancer identification is very challenging and more complicated than other genetic factors due to their position variation and free scattering. In addition, it has been proved that genetic variation in enhancers is related to human diseases. Therefore, identification of enhancers and their strength has important biological meaning. In this paper, a novel model named iEnhancer-MFGBDT is developed to identify enhancer and their strength by fusing multiple features and gradient boosting decision tree (GBDT). Multiple features include k-mer and reverse complement k-mer nucleotide composition based on DNA sequence, and second-order moving average, normalized Moreau-Broto auto-cross correlation and Moran auto-cross correlation based on dinucleotide physical structural property matrix. Then we use GBDT to select features and perform classification successively. The accuracies reach 78.67% and 66.04% for identifying enhancers and their strength on the benchmark dataset, respectively. Compared with other models, the results show that our model is useful and effective intelligent tool to identify enhancers and their strength, of which the datasets and source codes are available at https://github.com/shengli0201/iEnhancer-MFGBDT1.</p>
</abstract>
Publisher
American Institute of Mathematical Sciences (AIMS)
Subject
Applied Mathematics,Computational Mathematics,General Agricultural and Biological Sciences,Modeling and Simulation,General Medicine
Reference47 articles.
1. N. Omar, W. Y. Shiong, L. Xi, C. C Yee Ling, M. T. D. Abdullah, N. K. Lee, Enhancer prediction in proboscis monkey genome: A comparative study, J. Telecom. Electron. Computer Eng., 9 (2017), 175-179.
2. B. Liu, L. Y. Fang, R. Long, X. Lan, K. C. Chou, iEnhancer-2L: A two-layer predictor for identifying enhancers and their strength by pseudo k-tuple nucleotide composition, Bioinformatics, 32 (2016), 362-369.
3. H. M. Herz, Enhancer deregulation in cancer and other diseases, Bioessays, 38 (2016), 1003-1015.
4. G. Zhang, J. Shi, S. Zhu, Y. Lan, L. Xu, H. Yuan, et al., DiseaseEnhancer: A resource of human disease-associated enhancer catalog, Nucleic Acids Res., 46 (2018), D78-D84.
5. O. Corradin, P. C. Scacheri, Enhancer variants: Evaluating functions in common disease, Genome Med., 6 (2014), 85.
Cited by
7 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献