HVConv: Horizontal and Vertical Convolution for Remote Sensing Object Detection
-
Published:2024-05-24
Issue:11
Volume:16
Page:1880
-
ISSN:2072-4292
-
Container-title:Remote Sensing
-
language:en
-
Short-container-title:Remote Sensing
Author:
Chen Jinhui1ORCID, Lin Qifeng1ORCID, Huang Haibin1ORCID, Yu Yuanlong1, Zhu Daoye1ORCID, Fu Gang2
Affiliation:
1. College of Computer and Data Science, Fuzhou University, Fuzhou 350108, China 2. Department of Computing, The Hong Kong Polytechnic University, Hong Kong 999077, China
Abstract
Generally, the interesting objects in aerial images are completely different from objects in nature, and the remote sensing objects in particular tend to be more distinctive in aspect ratio. The existing convolutional networks have equal aspect ratios of the receptive fields, which leads to receptive fields either containing non-relevant information or being unable to fully cover the entire object. To this end, we propose Horizontal and Vertical Convolution, which is a plug-and-play module to address different aspect ratio problems. In our method, we introduce horizontal convolution and vertical convolution to expand the receptive fields in the horizontal and vertical directions, respectively, to reduce redundant receptive fields, so that remote sensing objects with different aspect ratios can achieve better receptive fields coverage, thereby achieving more accurate feature representation. In addition, we design an attention module to dynamically aggregate these two sub-modules to achieve more accurate feature coverage. Extensive experimental results on the DOTA and HRSC2016 datasets show that our HVConv achieves accuracy improvements in diverse detection architectures and obtains SOTA accuracy (mAP score of 77.60% with DOTA single-scale training and mAP score of 81.07% with DOTA multi-scale training). Various ablation studies were conducted as well, which is enough to verify the effectiveness of our model.
Funder
National Natural Science Foundation of China Natural Science Foundation of Fujian Province Research Program for Young and Middle-Aged Teachers of Fujian Province
Reference49 articles.
1. Align Deep Features for Oriented Object Detection;Han;IEEE Trans. Geosci. Remote Sens.,2022 2. Yang, X., Yan, J., Feng, Z., and He, T. (2021, January 4–7). R3Det: Refined Single-Stage Detector with Feature Refinement for Rotating Object. Proceedings of the 35th AAAI Conference on Artificial Intelligence, Virtual. 3. Pan, X., Ren, Y., Sheng, K., Dong, W., Yuan, H., Guo, X., Ma, C., and Xu, C. (2020, January 14–19). Dynamic refinement network for oriented and densely packed object detection. Proceedings of the 2020 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Virtual. 4. MEDNet: Multiexpert Detection Network With Unsupervised Clustering of Training Samples;Lin;IEEE Trans. Geosci. Remote Sens.,2022 5. Feng, L.Q., Luo Jun, L., Yuan Long, Y., and Fu, G. (November, January 29). A Multiple Prediction Mechanisms Ensemble for Complex Remote Sensing Scenes. Proceedings of the 31st ACM International Conference on Multimedia, Ottawa, ON, Canada.
|
|