Affiliation:
1. Qingshuihe Campus, University of Electronic Science and Technology of China, Chengdu 611731, China
Abstract
The teacher–student framework is widely employed for cross-domain object detection. However, it suffers from two problems. One is that large distribution discrepancies will cause critical performance drops. The other is that the samples that deviate from the overall distributions of both domains will greatly mislead the model. To solve these problems, we propose a style-guided adversarial teacher (SGAT) method for domain adaptation. Specifically, on the domain level, we generate target-like images based on source images to effectively narrow the gaps between domains. On the sample level, we denoise samples by estimating the probability density ratio of the ‘target-style’ and target distributions, which could filter out the unrelated samples and highlight the related ones. In this way, we could guarantee reliable samples. With these reliable samples, we learn the domain-invariant features through teacher–student mutual learning and adversarial learning. Extensive experiments verify the effectiveness of our method. In particular, we achieve 52.9% mAP on Clipart1k and 42.7% on Comic2k, which are 6.4% and 5.0% higher than the compared baselines.
Funder
National Natural Science Foundation of China
Sichuan Science and Technology Program
Reference51 articles.
1. Saito, K., Ushiku, Y., Harada, T., and Saenko, K. (2019, January 15–20). Strong-weak distribution alignment for adaptive object detection. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.
2. Xu, M., Wang, H., Ni, B., Tian, Q., and Zhang, W. (2020, January 14–19). Cross-domain detection via graph-induced prototype alignment. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.
3. Curriculum self-paced learning for cross-domain object detection;Soviany;Comput. Vis. Image Underst.,2021
4. Tarvainen, A., and Valpola, H. (2017, January 4–9). Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results. Proceedings of the Advances in Neural Information Processing Systems 30 (NIPS 2017), Long Beach, CA, USA.
5. Deng, J., Li, W., Chen, Y., and Duan, L. (2021, January 19–25). Unbiased mean teacher for cross-domain object detection. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Virtual.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献