Abstract
AbstractPixel-wise image segmentation is key for many Computer Vision applications. The training of deep neural networks for this task has expensive pixel-level annotation requirements, thus, motivating a growing interest on synthetic data to provide unlimited data and its annotations. In this paper, we focus on the generation and application of synthetic data as representative training corpuses for semantic segmentation of urban scenes. First, we propose a synthetic data generation protocol, which identifies key features affecting performance and provides datasets with variable complexity. Second, we adapt two popular weakly supervised domain adaptation approaches (combined training, fine-tuning) to employ synthetic and real data. Moreover, we analyze several backbone models, real/synthetic datasets and their proportions when combined. Third, we propose a new curriculum learning strategy to employ several synthetic and real datasets. Our major findings suggest the high performance impact of pace and order of synthetic and real data presentation, achieving state of the art results for well-known models. The results by training with the proposed dataset outperform popular alternatives, thus demonstrating the effectiveness of the proposed protocol. Our code and dataset are available at http://www-vpu.eps.uam.es/publications/WSDA_semantic/
Funder
Universidad Autónoma de Madrid
Publisher
Springer Science and Business Media LLC
Subject
Computer Networks and Communications,Hardware and Architecture,Media Technology,Software
Reference83 articles.
1. Balaji Y, Chellappa R, Feizi S (2019) Normalized wasserstein for mixture distributions with applications in adversarial learning and domain adaptation. In: Proc IEEE conf Comput Vis (ICCV), pp 6499–6507
2. Bengio Y, Louradour J, Collobert R, Weston J (2009) Curriculum learning. In: Proceedings of the 26th annual international conference on machine learning. ICML ’09. Association for computing machinery, pp 41–48
3. Biasetton M, Michieli U, Agresti G, Zanuttigh P (2019) Unsupervised domain adaptation for semantic segmentation of urban scenes. In: Proc IEEE conf comput vis pattern recognit (CVPR) workshops, vol 2019-june
4. Bousmalis K, Silberman N, Research G, York N, Dohan D, Erhan D, Brain G, Francisco S, Krishnan D (2019) Unsupervised pixel-level domain adaptation with generative adversarial networks. In: Proc IEEE conf Comput Vis Pattern recognit. (CVPR)
5. Chen S, Jia X, He J, Shi Y, Liu J (2021) Semi-supervised domain adaptation based on dual-level domain mixing for semantic segmentation. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (CVPR), pp 11018–11027
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献