Addressing the data bottleneck in medical deep learning models using a human-in-the-loop machine learning approach-Reference-Cited by-同舟云学术

Addressing the data bottleneck in medical deep learning models using a human-in-the-loop machine learning approach

Published:2023-11-21 Issue:5 Volume:36 Page:2597-2616
ISSN:0941-0643
Container-title:Neural Computing and Applications
language:en
Short-container-title:Neural Comput & Applic

Author:

Mosqueira-Rey Eduardo^ORCID,Hernández-Pereira Elena,Bobes-Bascarán José,Alonso-Ríos David,Pérez-Sánchez Alberto,Fernández-Leal Ángel,Moret-Bonillo Vicente,Vidal-Ínsua Yolanda,Vázquez-Rivera Francisca

Abstract

AbstractAny machine learning (ML) model is highly dependent on the data it uses for learning, and this is even more important in the case of deep learning models. The problem is a data bottleneck, i.e. the difficulty in obtaining an adequate number of cases and quality data. Another issue is improving the learning process, which can be done by actively introducing experts into the learning loop, in what is known as human-in-the-loop (HITL) ML. We describe an ML model based on a neural network in which HITL techniques were used to resolve the data bottleneck problem for the treatment of pancreatic cancer. We first augmented the dataset using synthetic cases created by a generative adversarial network. We then launched an active learning (AL) process involving human experts as oracles to label both new cases and cases by the network found to be suspect. This AL process was carried out simultaneously with an interactive ML process in which feedback was obtained from humans in order to develop better synthetic cases for each iteration of training. We discuss the challenges involved in including humans in the learning process, especially in relation to human–computer interaction, which is acquiring great importance in building ML models and can condition the success of a HITL approach. This paper also discusses the methodological approach adopted to address these challenges.

Funder

Agencia Estatal de Investigación

Xunta de Galicia

CITIC

Universidade da Coruña

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence,Software

Link

https://link.springer.com/content/pdf/10.1007/s00521-023-09197-2.pdf

Reference94 articles.