A method for balancing a multi-labeled biomedical dataset-Reference-Cited by-同舟云学术

A method for balancing a multi-labeled biomedical dataset

Published:2022-03-14 Issue:2 Volume:29 Page:209-225
ISSN:1069-2509
Container-title:Integrated Computer-Aided Engineering
language:
Short-container-title:ICA

Author:

Mukhin A.V.¹,Kilbas I.A.¹,Paringer R.A.¹²,Ilyasova N. Yu.¹²,Kupriyanov A.V.¹²

Affiliation:

1. Samara National Research University, Moskovskoye Shosse, 34, Samara, Russia

2. IPSI RAS – Branch of the FSRC “Crystallography and Photonics” RAS, Samara, Russia

Abstract

In this paper, we propose a data balancing method for multi-label biomedical data. The method can be applied in the case of semantic segmentation problems for balancing the corresponding image data. The proposed method performs oversampling of instances of minority classes in a way that increases the frequencies of appearance (a ratio of number of samples, containing this class, over the total number of samples in the dataset) of minority classes in the data, thereby reducing the class imbalance. The effectiveness of the proposed method is shown experimentally by applying it to two highly unbalanced biomedical image datasets. A convolutional neural network (CNN) was trained on several versions of those datasets: one balanced with the proposed method, another balanced with manual oversampling and an unbalanced version. The results of the experiments validate the effectiveness of the proposed method, proving that it allows the influence of class imbalance on the learning algorithm to be reduced, thus improving its original classification results for most of the classes. Apart from biomedical image data, the proposed method was applied to several common multi-label datasets. Inherently, the proposed method does not make any assumptions about the underlying structure of the data to be balanced; therefore, it can be applied to all types of data (vectors, images, etc.) that can be described in a multi-label framework. It also can be used in conjunction with any learning algorithm that is suitable for multi-label data. To illustrate its wider applicability, a series of experiments was conducted using seven common multi-label datasets. An experimental comparison to existing multi-label data balancing approaches is provided, as well. The experimental results show that the proposed method presents a competitive alternative to existing approaches.

Publisher

IOS Press

Subject

Artificial Intelligence,Computational Theory and Mathematics,Computer Science Applications,Theoretical Computer Science,Software

Reference40 articles.

1. Acrophobia quantified by EEG based on CNN incorporating Granger causality;Hu;International Journal of Neural Systems.,2021

2. Semantic segmentation of satellite images of airports using convolutional neural networks;Vadim;Computer Optics.,2020

3. Reachability analysis of neural masses and seizure control based on combination convolutional neural network;Ma;International Journal of Neural Systems.,2020

4. Automatic seizure detection based on S-Transform and deep convolutional neural network;Liu;International Journal of Neural Systems.,2020

5. Reachability analysis of neural masses and seizure control based on combination convolutional neural network;Ma;International Journal of Neural Systems.,2020

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Deep Learning in Environmental Toxicology: Current Progress and Open Challenges;ACS ES&T Water;2023-06-20

2. Image Processing Systems Institute of the RAS: Responses to Current Challenges;2023 IX International Conference on Information Technology and Nanotechnology (ITNT);2023-04-17

3. Application of Artificial Intelligence in Ophthalmology for Coagulate Map Formation to Carry Out Laser Eye Treatment;Pattern Recognition, Computer Vision, and Image Processing. ICPR 2022 International Workshops and Challenges;2023

4. A system for biomedical audio signal processing based on high performance computing techniques;Integrated Computer-Aided Engineering;2022-11-24

5. Ontology-based Meta AutoML;Integrated Computer-Aided Engineering;2022-08-26