Abstract
ABSTRACTMicroRNAs are special RNA sequences containing 22 nucleotides and are capable of regulating almost 60% of highly complex mammalian transcriptome. Presently, there exists very limited approaches capable of visualizing miRNA locations inside cell to reveal the hidden pathways, and mechanisms behind miRNA functionality, transport, and biogenesis. State-of-the-art miRNA sub-cellular location prediction MIRLocatar approach makes use of sequence to sequence model along with pre-train k-mer embeddings. Existing pre-train k-mer embedding generation methodologies focus on the extraction of semantics of k-mers. In RNA sequences, rather than semantics, positional information of nucleotides is more important because distinct positions of four basic nucleotides actually define the functionality of RNA molecules. Considering the dynamicity and importance of nucleotides positions, instead of learning representation on the basis of k-mers semantics, we propose a novel kmerRP2vec feature representation approach that fuses positional information of k-mers to randomly initialized neural k-mer embeddings. Effectiveness of proposed feature representation approach is evaluated with two deep learning based convolutional neural network CNN and recurrent neural network RNN methodologies using 8 evaluation measures. Experimental results on a public benchmark miRNAsubloc dataset prove that proposed kmerRP2vec approach along with a simple CNN model outperforms state-of-the-art MirLocator approach with a significant margin of 18% and 19% in terms of precision and recall.
Publisher
Cold Spring Harbor Laboratory
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献