A machine learning technique for identifying DNA enhancer regions utilizing CIS-regulatory element patterns-Reference-Cited by-同舟云学术

A machine learning technique for identifying DNA enhancer regions utilizing CIS-regulatory element patterns

Published:2022-09-07 Issue:1 Volume:12 Page:
ISSN:2045-2322
Container-title:Scientific Reports
language:en
Short-container-title:Sci Rep

Author:

Butt Ahmad Hassan,Alkhalifah Tamim,Alturise Fahad,Khan Yaser Daanial

Abstract

AbstractEnhancers regulate gene expression, by playing a crucial role in the synthesis of RNAs and proteins. They do not directly encode proteins or RNA molecules. In order to control gene expression, it is important to predict enhancers and their potency. Given their distance from the target gene, lack of common motifs, and tissue/cell specificity, enhancer regions are thought to be difficult to predict in DNA sequences. Recently, a number of bioinformatics tools were created to distinguish enhancers from other regulatory components and to pinpoint their advantages. However, because the quality of its prediction method needs to be improved, its practical application value must also be improved. Based on nucleotide composition and statistical moment-based features, the current study suggests a novel method for identifying enhancers and non-enhancers and evaluating their strength. The proposed study outperformed state-of-the-art techniques using fivefold and tenfold cross-validation in terms of accuracy. The accuracy from the current study results in 86.5% and 72.3% in enhancer site and its strength prediction respectively. The results of the suggested methodology point to the potential for more efficient and successful outcomes when statistical moment-based features are used. The current study's source code is available to the research community at https://github.com/csbioinfopk/enpred.

Publisher

Springer Science and Business Media LLC

Subject

Multidisciplinary

Link

https://www.nature.com/articles/s41598-022-19099-3.pdf

Reference99 articles.

1. Erwin, G. D. et al. Integrating diverse datasets improves developmental enhancer prediction. PLoS Comput. Biol. 10(6), e1003677–e1003677. https://doi.org/10.1371/journal.pcbi.1003677 (2014).

2. Visel, A., Rubin, E. M. & Pennacchio, L. A. Genomic views of distant-acting enhancers. Nature 461(7261), 199–205. https://doi.org/10.1038/nature08451 (2009).

3. Sakabe, N. J., Savic, D. & Nobrega, M. A. Transcriptional enhancers in development and disease. Genome Biol. 13(1), 238 (2012).

4. Heintzman, N. D. & Ren, B. Finding distal regulatory elements in the human genome. Curr. Opin. Genet. Dev. 19(6), 541–549. https://doi.org/10.1016/j.gde.2009.09.006 (2009).

5. Blackwood, E. M. & Kadonaga, J. T. Going the distance: A current view of enhancer action. Science 281, 60 (1998).

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. m5c-iDeep: 5-Methylcytosine sites identification through deep learning;Methods;2024-10

2. Using a K-mer Based Approach with Machine Learning Classifiers for Enhancer Identification and Classification;2024-08-28

3. An intelligent model for prediction of abiotic stress-responsive microRNAs in plants using statistical moments based features and ensemble approaches;Methods;2024-08

4. Identification of 6-methyladenosine sites using novel feature encoding methods and ensemble models;Scientific Reports;2024-04-08

5. Gene replacement therapies for inherited disorders of neurotransmission: Current progress in succinic semialdehyde dehydrogenase deficiency;Journal of Inherited Metabolic Disease;2024-04-06