A Prediction Approach Based on Self-Training and Deep Learning for Biological Data-Reference-Cited by-同舟云学术

A Prediction Approach Based on Self-Training and Deep Learning for Biological Data

Published:2023-12-29 Issue: Volume: Page:78-93
ISSN:
Container-title:Research Anthology on Bioinformatics, Genomics, and Computational Biology
language:ng
Short-container-title:

Author:

Boufenara Mohamed Nadjib¹,Boufaida Mahmoud¹,Berkane Mohamed Lamine¹

Affiliation:

1. LIRE Laboratory, Abdelhamid MEHRI, Constantine 2 University, Algeria

Abstract

With the exponential growth of biological data, labeling this kind of data becomes difficult and costly. Although unlabeled data are comparatively more plentiful than labeled ones, most supervised learning methods are not designed to use unlabeled data. Semi-supervised learning methods are motivated by the availability of large unlabeled datasets rather than a small amount of labeled examples. However, incorporating unlabeled data into learning does not guarantee an improvement in classification performance. This paper introduces an approach based on a model of semi-supervised learning, which is the self-training with a deep learning algorithm to predict missing classes from labeled and unlabeled data. In order to assess the performance of the proposed approach, two datasets are used with four performance measures: precision, recall, F-measure, and area under the ROC curve (AUC).

Publisher

IGI Global

Reference34 articles.

1. Agarap, A. F. (2018). Deep learning using rectified linear units (relu). arXiv preprint arXiv:1803.08375

2. Allen, N. E., Sudlow, C., Peakman, T., & Collins, R. (2014). UK biobank data: come and get it. Academic Press.

3. Importance of nonsynonymous OCA 2 variants in human eye color prediction

4. Deep learning for computational biology

5. 23andMe and the FDA