TranSDet: Toward Effective Transfer Learning for Small-Object Detection
-
Published:2023-07-12
Issue:14
Volume:15
Page:3525
-
ISSN:2072-4292
-
Container-title:Remote Sensing
-
language:en
-
Short-container-title:Remote Sensing
Author:
Xu Xinkai123ORCID, Zhang Hailan1ORCID, Ma Yan23ORCID, Liu Kang1ORCID, Bao Hong23, Qian Xu1
Affiliation:
1. School of Artificial Intelligence, China University of Mining and Technology-Beijing, Beijing 100083, China 2. Beijing Key Laboratory of Information Service Engineering, Beijing Union University, Beijing 100101, China 3. College of Robotics, Beijing Union University, Beijing 100027, China
Abstract
Small-object detection is a challenging task in computer vision due to the limited training samples and low-quality images. Transfer learning, which transfers the knowledge learned from a large dataset to a small dataset, is a popular method for improving performance on limited data. However, we empirically find that due to the dataset discrepancy, directly transferring the model trained on a general object dataset to small-object datasets obtains inferior performance. In this paper, we propose TranSDet, a novel approach for effective transfer learning for small-object detection. Our method adapts a model trained on a general dataset to a small-object-friendly model by augmenting the training images with diverse smaller resolutions. A dynamic resolution adaptation scheme is employed to ensure consistent performance on various sizes of objects using meta-learning. Additionally, the proposed method introduces two network components, an FPN with shifted feature aggregation and an anchor relation module, which are compatible with transfer learning and effectively improve small-object detection performance. Extensive experiments on the TT100K, BUUISE-MO-Lite, and COCO datasets demonstrate that TranSDet achieves significant improvements compared to existing methods. For example, on the TT100K dataset, TranSDet outperforms the state-of-the-art method by 8.0% in terms of the mean average precision (mAP) for small-object detection. On the BUUISE-MO-Lite dataset, TranSDet improves the detection accuracy of RetinaNet and YOLOv3 by 32.2% and 12.8%, respectively.
Funder
Key project of the National Nature Science Foundation of China
Subject
General Earth and Planetary Sciences
Reference63 articles.
1. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks;Ren;IEEE Transactions on Pattern Analysis and Machine Intelligence,2017 2. Lin, T.Y., Goyal, P., Girshick, R., He, K., and Dollar, P. (2017, January 22–29). Focal Loss for Dense Object Detection. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy. 3. Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.Y., and Berg, A.C. (2016). Computer Vision—ECCV 2016, Springer. 4. Shivappriya, S.N., Priyadarsini, M.J.P., Stateczny, A., Puttamadappa, C., and Parameshachari, B.D. (2021). Cascade Object Detection and Remote Sensing Object Detection Method Based on Trainable Activation Function. Remote Sens., 13. 5. Concealed Object Detection;Fan;IEEE Transactions on Pattern Analysis and Machine Intelligence,2022
Cited by
11 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|