On the Importance of Diversity When Training Deep Learning Segmentation Models with Error-Prone Pseudo-Labels-Reference-Cited by-同舟云学术

On the Importance of Diversity When Training Deep Learning Segmentation Models with Error-Prone Pseudo-Labels

Published:2024-06-13 Issue:12 Volume:14 Page:5156
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Yang Nana¹,Rongione Charles²,Jacquemart Anne-Laure²,Draye Xavier²,Vleeschouwer Christophe De¹^ORCID

Affiliation:

1. ICTEAM Institute, UCLouvain, 1348 Louvain-la-Neuve, Belgium

2. ELI Institute, UCLouvain, 1348 Louvain-la-Neuve, Belgium

Abstract

The key to training deep learning (DL) segmentation models lies in the collection of annotated data. The annotation process is, however, generally expensive in human resources. Our paper leverages deep or traditional machine learning methods trained on a small set of manually labeled data to automatically generate pseudo-labels on large datasets, which are then used to train so-called data-reinforced deep learning models. The relevance of the approach is demonstrated in two applicative scenarios that are distinct both in terms of task and pseudo-label generation procedures, enlarging the scope of the outcomes of our study. Our experiments reveal that (i) data reinforcement helps, even with error-prone pseudo-labels, (ii) convolutional neural networks have the capability to regularize their training with respect to labeling errors, and (iii) there is an advantage to increasing diversity when generating the pseudo-labels, either by enriching the manual annotation through accurate annotation of singular samples, or by considering soft pseudo-labels per sample when prior information is available about their certainty.

Funder

China Scholarship Council

Belgian F.N.R.S

Publisher

MDPI AG

Link

https://www.mdpi.com/2076-3417/14/12/5156/pdf

Reference54 articles.

1. Visualizing the effects of predictor variables in black box supervised learning models;Apley;J. R. Stat. Soc. Ser. B Stat. Methodol.,2020

2. Supervised learning: Classification;Castelli;Encycl. Bioinform. Comput. Biol.,2018

3. Application of supervised learning to validation of damage detection;Sarmadi;Arch. Appl. Mech.,2021

4. Zhou, Z.H. (2021). Semi-supervised learning. Machine Learning, Springer.

5. Ouali, Y., Hudelot, C., and Tami, M. (2020). An overview of deep semi-supervised learning. arXiv.