Affiliation:
1. National Key Lab for Novel Software Technology, Nanjing University, China
Abstract
In this paper, we focus on the emerging field of web-based object detection, which has gained considerable attention due to its ability to utilize large amounts of web data for training, thus eliminating the need for labor-intensive manual annotations. However, the noisy and ever-evolving nature of web data poses challenges in preparing high-quality datasets for web-based object detection. To address these challenges, we propose a fully automatic dataset preparation method in this paper. Our proposed method incorporates a hierarchical clustering module that assigns multiple precise labels to each image. This module is based on our observation that web image data exhibits different distributions at varying granularities. Furthermore, an evolutionary relabeling module ensures the adaptability of both the prepared dataset and trained detection models to the ever-evolving web data. Extensive experiments demonstrate that our method outperforms other web-based methods, and achieves a comparable performance to those manually labeled benchmark datasets.
Publisher
Association for Computing Machinery (ACM)
Reference36 articles.
1. T/CESA 1307-2024. 2024. Information technology - Technical requirements of collaborative learning systems for heterogeneous computing.
2. T/CESA 1308-2024. 2024. Information technology - Data quality requirements for heterogeneous computing.
3. Hakan Bilen and Andrea Vedaldi. 2016. Weakly supervised deep detection networks. In Proceedings of the IEEE conference on computer vision and pattern recognition. 2846–2854.
4. End-to-End Object Detection with Transformers
5. Webly Supervised Learning of Convolutional Networks