HVConv: Horizontal and Vertical Convolution for Remote Sensing Object Detection-Reference-Cited by-同舟云学术

HVConv: Horizontal and Vertical Convolution for Remote Sensing Object Detection

Published:2024-05-24 Issue:11 Volume:16 Page:1880
ISSN:2072-4292
Container-title:Remote Sensing
language:en
Short-container-title:Remote Sensing

Author:

Chen Jinhui¹^ORCID,Lin Qifeng¹^ORCID,Huang Haibin¹^ORCID,Yu Yuanlong¹,Zhu Daoye¹^ORCID,Fu Gang²

Affiliation:

1. College of Computer and Data Science, Fuzhou University, Fuzhou 350108, China

2. Department of Computing, The Hong Kong Polytechnic University, Hong Kong 999077, China

Abstract

Generally, the interesting objects in aerial images are completely different from objects in nature, and the remote sensing objects in particular tend to be more distinctive in aspect ratio. The existing convolutional networks have equal aspect ratios of the receptive fields, which leads to receptive fields either containing non-relevant information or being unable to fully cover the entire object. To this end, we propose Horizontal and Vertical Convolution, which is a plug-and-play module to address different aspect ratio problems. In our method, we introduce horizontal convolution and vertical convolution to expand the receptive fields in the horizontal and vertical directions, respectively, to reduce redundant receptive fields, so that remote sensing objects with different aspect ratios can achieve better receptive fields coverage, thereby achieving more accurate feature representation. In addition, we design an attention module to dynamically aggregate these two sub-modules to achieve more accurate feature coverage. Extensive experimental results on the DOTA and HRSC2016 datasets show that our HVConv achieves accuracy improvements in diverse detection architectures and obtains SOTA accuracy (mAP score of 77.60% with DOTA single-scale training and mAP score of 81.07% with DOTA multi-scale training). Various ablation studies were conducted as well, which is enough to verify the effectiveness of our model.

Funder

National Natural Science Foundation of China

Natural Science Foundation of Fujian Province

Research Program for Young and Middle-Aged Teachers of Fujian Province

Publisher

MDPI AG

Link

https://www.mdpi.com/2072-4292/16/11/1880/pdf

Reference49 articles.

1. Align Deep Features for Oriented Object Detection;Han;IEEE Trans. Geosci. Remote Sens.,2022

2. Yang, X., Yan, J., Feng, Z., and He, T. (2021, January 4–7). R3Det: Refined Single-Stage Detector with Feature Refinement for Rotating Object. Proceedings of the 35th AAAI Conference on Artificial Intelligence, Virtual.

3. Pan, X., Ren, Y., Sheng, K., Dong, W., Yuan, H., Guo, X., Ma, C., and Xu, C. (2020, January 14–19). Dynamic refinement network for oriented and densely packed object detection. Proceedings of the 2020 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Virtual.

4. MEDNet: Multiexpert Detection Network With Unsupervised Clustering of Training Samples;Lin;IEEE Trans. Geosci. Remote Sens.,2022

5. Feng, L.Q., Luo Jun, L., Yuan Long, Y., and Fu, G. (November, January 29). A Multiple Prediction Mechanisms Ensemble for Complex Remote Sensing Scenes. Proceedings of the 31st ACM International Conference on Multimedia, Ottawa, ON, Canada.