A semi-supervised machine learning framework for microRNA classification-Reference-Cited by-同舟云学术

A semi-supervised machine learning framework for microRNA classification

Published:2019-10 Issue:S1 Volume:13 Page:
ISSN:1479-7364
Container-title:Human Genomics
language:en
Short-container-title:Hum Genomics

Author:

Sheikh Hassani Mohsen,Green James R.

Abstract

Abstract Background MicroRNAs (miRNAs) are a family of short, non-coding RNAs that have been linked to critical cellular activities, most notably regulation of gene expression. The identification of miRNA is a cross-disciplinary approach that requires both computational identification methods and wet-lab validation experiments, making it a resource-intensive procedure. While numerous machine learning methods have been developed to increase classification accuracy and thus reduce validation costs, most methods use supervised learning and thus require large labeled training data sets, often not feasible for less-sequenced species. On the other hand, there is now an abundance of unlabeled RNA sequence data due to the emergence of high-throughput wet-lab experimental procedures, such as next-generation sequencing. Results This paper explores the application of semi-supervised machine learning for miRNA classification in order to maximize the utility of both labeled and unlabeled data. We here present the novel combination of two semi-supervised approaches: active learning and multi-view co-training. Results across six diverse species show that this multi-stage semi-supervised approach is able to improve classification performance using very small numbers of labeled instances, effectively leveraging the available unlabeled data. Conclusions The proposed semi-supervised miRNA classification pipeline holds the potential to identify novel miRNA with high recall and precision while requiring very small numbers of previously known miRNA. Such a method could be highly beneficial when studying miRNA in newly sequenced genomes of niche species with few known examples of miRNA.

Publisher

Springer Science and Business Media LLC

Subject

Drug Discovery,Genetics,Molecular Biology,Molecular Medicine

Link

http://link.springer.com/content/pdf/10.1186/s40246-019-0221-7.pdf

Reference52 articles.

1. Miranda K, Huynh T, Tay Y, Ang Y, Tam W, Thomson AM, et al. A pattern-based method for the identification of MicroRNA binding sites and their corresponding heteroduplexes. Cell. 2006;126:1203–17.

2. Iwasaki Y, Kiga K, Kayo H, Fukuda-Yuzawa Y, Weise J, Inada T, et al. Global microRNA elevation by inducible Exportin 5 regulates cell cycle entry. RNA. 2013;19:490–7.

3. La Torre A, Georgi S, Reh TA. Conserved microRNA pathway regulates developmental timing of retinal neurogenesis. Proc Natl Acad Sci. 2013;110:E2362–70.

4. Ren Z, Ambros VR. Caenorhabditis elegans microRNAs of the let-7 family act in innate immune response circuits and confer robust developmental timing against pathogen stress. Proc Natl Acad Sci. 2015;112:E2366–75.

5. Otto T, Candido SV, Pilarz MS, Sicinska E, Bronson RT, Bowden M, et al. Cell cycle-targeting microRNAs promote differentiation by enforcing cell-cycle exit. Proc Natl Acad Sci. 2017;114:10660–5.

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Species-specific microRNA discovery and target prediction in the soybean cyst nematode;Scientific Reports;2023-10-17

2. Data-driven decision-making for precision diagnosis of digestive diseases;BioMedical Engineering OnLine;2023-09-01

3. Deep learning and ensemble deep learning for circRNA-RBP interaction prediction in the last decade: A review;Engineering Applications of Artificial Intelligence;2023-08

4. Comprehensive study of semi-supervised learning for DNA methylation-based supervised classification of central nervous system tumors;BMC Bioinformatics;2022-06-08

5. Semisupervised Deep Learning Techniques for Predicting Acute Respiratory Distress Syndrome From Time-Series Clinical Data: Model Development and Validation Study;JMIR Formative Research;2021-09-14