Author:
Khalid Razia,Naveed Hammad,Khalid Zoya
Abstract
AbstractLong non-coding RNAs (lncRNAs), which were once considered as transcriptional noise, are now in the limelight of current research. LncRNAs play a major role in regulating various biological processes such as imprinting, cell differentiation, and splicing. The mutations of lncRNAs are involved in various complex diseases. Identifying lncRNA-disease associations has gained a lot of attention as predicting it efficiently will lead towards better disease treatment. In this study, we have developed a machine learning model that predicts disease-related lncRNAs by combining sequence and structure-based features. The features were trained on SVM and Random Forest classifiers. We have compared our method with the state-of-the-art and obtained the highest F1 score of 76% on SVM classifier. Moreover, this study has overcome two serious limitations of the reported method which are lack of redundancy checking and implementation of oversampling for balancing the positive and negative class. Our method has achieved improved performance among machine learning models reported for lncRNA-disease associations. Combining multiple features together specifically lncRNAs sequence mutation has a significant contribution to the disease related lncRNA prediction.
Publisher
Springer Science and Business Media LLC
Reference30 articles.
1. Wapinski, O. & Chang, H. Y. Long noncoding RNAs and human disease. Trends Cell Biol. 21, 354–361 (2011).
2. Liu, S. et al. PredLnc-GFStack: A global sequence feature based on a stacked ensemble learning method for predicting lncRNAs from transcripts. Genes 10, 672 (2019).
3. Zeng, C. & Hamada, M. Identifying sequence features that drive ribosomal association for lncRNAs. BMC Genomics 10, 41–49 (2018).
4. Chen, X. et al. Computational models for lncrna function prediction and functional similarity calculation. Brief. Funct. Genomics 18, 58–82 (2019).
5. Chen, X., Yan, C., Zhang, X. & You, Z. H. Long non-coding RNAs and complex diseases: From experimental results to computational models. Brief. Bioinform. 18, 558–576 (2016).
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献