Addressing Noisy Pixels in Weakly Supervised Semantic Segmentation with Weights Assigned-Reference-Cited by-同舟云学术

Addressing Noisy Pixels in Weakly Supervised Semantic Segmentation with Weights Assigned

Published:2024-08-15 Issue:16 Volume:12 Page:2520
ISSN:2227-7390
Container-title:Mathematics
language:en
Short-container-title:Mathematics

Author:

Qian Feng¹^ORCID,Yang Juan²,Tang Sipeng³,Chen Gao⁴^ORCID,Yan Jingwen²

Affiliation:

1. Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun 130033, China

2. College of Engineering, Shantou University, Shantou 515063, China

3. China Mobile Communications Group Guangdong Co., Ltd. Shantou Branch, Shantou 515041, China

4. School of Telecommunications Engineering and Intelligentization, Dongguan University of Technology, Dongguan 523808, China

Abstract

Weakly supervised semantic segmentation (WSSS) aims to segment objects without a heavy burden of dense annotations. Pseudo-masks serve as supervisory information for training segmentation models, which is crucial to the performance of segmentation models. However, the generated pseudo-masks contain significant noisy labels, which leads to poor performance of the segmentation models trained on these pseudo-masks. Few studies address this issue, as these noisy labels remain inevitable even after the pseudo-masks are improved. In this paper, we propose an uncertainty-weight transform module to mitigate the impact of noisy labels on model performance. It is noteworthy that our approach is not aimed at eliminating noisy labels but rather enhancing the robustness of the model to noisy labels. The proposed method adopts a frequency-based approach to estimate pixel uncertainty. Moreover, the uncertainty of pixels is transformed into loss weights through a set of well-designed functions. After dynamically assigning weights, the model allocates attention to each pixel in a significantly differentiated manner. Meanwhile, the impact of noisy labels on model performance is weakened. Experiments validate the effectiveness of the proposed method, achieving state-of-the-art results of 69.3% on PASCAL VOC 2012 and 39.3% on MS COCO 2014, respectively.

Funder

State key laboratory major special projects of Jilin Province Science and Technology Development Plan

Guangdong Provincial University Innovation Team Project

Guangdong Province Natural Science Foundation

Songshan Lake Sci-tech Commissoner Program

Publisher

MDPI AG

Link

https://www.mdpi.com/2227-7390/12/16/2520/pdf

Reference57 articles.

1. Kong, L., Ren, J., Pan, L., and Liu, Z. (2023, January 17–24). Lasermix for semi-supervised lidar semantic segmentation. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada.

2. Sepico: Semantic-guided pixel contrast for domain adaptive semantic segmentation;Xie;IEEE Trans. Pattern Anal. Mach. Intell.,2023

3. A survey on label-efficient deep image segmentation: Bridging the gap between weak supervision and dense prediction;Shen;IEEE Trans. Pattern Anal. Mach. Intell.,2023

4. Lai, X., Tian, Z., Jiang, L., Liu, S., Zhao, H., Wang, L., and Jia, J. (2021, January 20–25). Semi-supervised semantic segmentation with directional context-aware consistency. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.

5. Hu, R., Dollár, P., He, K., Darrell, T., and Girshick, R. (2018, January 18–23). Learning to segment every thing. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.