A Decoupled Semantic–Detail Learning Network for Remote Sensing Object Detection in Complex Backgrounds-Reference-Cited by-同舟云学术

A Decoupled Semantic–Detail Learning Network for Remote Sensing Object Detection in Complex Backgrounds

Published:2023-07-24 Issue:14 Volume:12 Page:3201
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Ruan Hao¹^ORCID,Qian Wenbin¹,Zheng Zhihong¹^ORCID,Peng Yingqiong¹

Affiliation:

1. School of Software, Jiangxi Agricultural University, Nanchang 330045, China

Abstract

Detecting multi-scale objects in complex backgrounds is a crucial challenge in remote sensing. The main challenge is that the localization and identification of objects in complex backgrounds can be inaccurate. To address this issue, a decoupled semantic–detail learning network (DSDL-Net) was proposed. Our proposed approach comprises two components. Firstly, we introduce a multi-receptive field feature fusion and detail mining (MRF-DM) module, which learns higher semantic-level representations by fusing multi-scale receptive fields. Subsequently, it uses multi-scale pooling to preserve detail texture information at different scales. Secondly, we present an adaptive cross-level semantic–detail fusion (CSDF) network that leverages a feature pyramid with fusion between detailed features extracted from the backbone network and high-level semantic features obtained from the topmost layer of the pyramid. The fusion is accomplished through two rounds of parallel global–local contextual feature extraction, with shared learning for global context information between the two rounds. Furthermore, to effectively enhance fine-grained texture features conducive to object localization and features conducive to object semantic recognition, we adopt and improve two enhancement modules with attention mechanisms, making them simpler and more lightweight. Our experimental results demonstrate that our approach outperforms 12 benchmark models on three publicly available remote sensing datasets (DIOR, HRRSD, and RSOD) regarding average precision (AP) at small, medium, and large scales. On the DIOR dataset, our model achieved a 2.19% improvement in mAP@0.5 compared to the baseline model, with a parameter reduction of 14.07%.

Funder

National Natural Science Foundation of China

National Key Research and Development Program of China

Natural Science Foundation of Jiangxi Province

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/12/14/3201/pdf

Reference44 articles.

1. Remote sensing in urban planning: Contributions towards ecologically sound policies?;Wellmann;Landsc. Urban Plan.,2020

2. Remote sensing technology for mapping and monitoring land-cover and land-use change;Rogan;Prog. Plan.,2004

3. Applications of remote sensing and GIS in natural resource management;Kumar;J. Andaman Sci. Assoc.,2015

4. Advances in remote sensing for oil spill disaster management: State-of-the-art sensors technology for oil spill surveillance;Jha;Sensors,2008

5. Wang, C.Y., Bochkovskiy, A., and Liao, H.Y.M. (2021, January 20–25). Scaled-yolov4: Scaling cross stage partial network. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. FFCA-YOLO for Small Object Detection in Remote Sensing Images;IEEE Transactions on Geoscience and Remote Sensing;2024