Affiliation:
1. School of Software, Jiangxi Agricultural University, Nanchang 330045, China
Abstract
Detecting multi-scale objects in complex backgrounds is a crucial challenge in remote sensing. The main challenge is that the localization and identification of objects in complex backgrounds can be inaccurate. To address this issue, a decoupled semantic–detail learning network (DSDL-Net) was proposed. Our proposed approach comprises two components. Firstly, we introduce a multi-receptive field feature fusion and detail mining (MRF-DM) module, which learns higher semantic-level representations by fusing multi-scale receptive fields. Subsequently, it uses multi-scale pooling to preserve detail texture information at different scales. Secondly, we present an adaptive cross-level semantic–detail fusion (CSDF) network that leverages a feature pyramid with fusion between detailed features extracted from the backbone network and high-level semantic features obtained from the topmost layer of the pyramid. The fusion is accomplished through two rounds of parallel global–local contextual feature extraction, with shared learning for global context information between the two rounds. Furthermore, to effectively enhance fine-grained texture features conducive to object localization and features conducive to object semantic recognition, we adopt and improve two enhancement modules with attention mechanisms, making them simpler and more lightweight. Our experimental results demonstrate that our approach outperforms 12 benchmark models on three publicly available remote sensing datasets (DIOR, HRRSD, and RSOD) regarding average precision (AP) at small, medium, and large scales. On the DIOR dataset, our model achieved a 2.19% improvement in mAP@0.5 compared to the baseline model, with a parameter reduction of 14.07%.
Funder
National Natural Science Foundation of China
National Key Research and Development Program of China
Natural Science Foundation of Jiangxi Province
Subject
Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering
Reference44 articles.
1. Remote sensing in urban planning: Contributions towards ecologically sound policies?;Wellmann;Landsc. Urban Plan.,2020
2. Remote sensing technology for mapping and monitoring land-cover and land-use change;Rogan;Prog. Plan.,2004
3. Applications of remote sensing and GIS in natural resource management;Kumar;J. Andaman Sci. Assoc.,2015
4. Advances in remote sensing for oil spill disaster management: State-of-the-art sensors technology for oil spill surveillance;Jha;Sensors,2008
5. Wang, C.Y., Bochkovskiy, A., and Liao, H.Y.M. (2021, January 20–25). Scaled-yolov4: Scaling cross stage partial network. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献