Abstract
AbstractIn the past, number of methods have been developed for predicting single label subcellular localization of mRNA in a cell. Only limited methods had been built to predict multi-label subcellular localization of mRNA. Most of the existing methods are slow and cannot be implemented at transcriptome scale. In this study, a fast and reliable method had been developed for predicting multi-label subcellular localization of mRNA that can be implemented at genome scale. Firstly, deep learning method based on convolutional neural network method have been developed using one-hot encoding and attained an average AUROC - 0.584 (0.543 – 0.605). Secondly, machine learning based methods have been developed using mRNA sequence composition, our XGBoost classifier achieved an average AUROC - 0.709 (0.668 - 0.732). In addition to alignment free methods, we also developed alignment-based methods using similarity and motif search techniques. Finally, a hybrid technique has been developed that combine XGBoost models and motif-based searching and achieved an average AUROC 0.742 (0.708 - 0.816). Our method – MRSLpred, developed in this study is complementary to the existing method. One of the major advantages of our method over existing methods is its speed, it can scan all mRNA of a transcriptome in few hours. A publicly accessible webserver and a standalone tool has been developed to facilitate researchers (Webserver:https://webs.iiitd.edu.in/raghava/mrslpred/).Key PointsPrediction of Subcellular localization of mRNAClassification of mRNA based on Motif and BLAST searchCombination of alignment based and alignment free techniquesA fast method for subcellular localization of mRNAA web server and standalone software
Publisher
Cold Spring Harbor Laboratory
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献