MFDANet: Multi-Scale Feature Dual-Stream Aggregation Network for Salient Object Detection-Reference-Cited by-同舟云学术

MFDANet: Multi-Scale Feature Dual-Stream Aggregation Network for Salient Object Detection

Published:2023-06-29 Issue:13 Volume:12 Page:2880
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Ge Bin¹^ORCID,Pei Jiajia¹,Xia Chenxing¹²,Wu Taolin¹

Affiliation:

1. College of Computer Science and Engineering, Anhui University of Science and Technology, Huainan 232001, China

2. Institute of Energy, Hefei Comprehensive National Science Center, Hefei 230031, China

Abstract

With the development of deep learning, significant improvements and optimizations have been made in salient object detection. However, many salient object detection methods have limitations, such as insufficient context information extraction, limited interaction modes for different level features, and potential information loss due to a single interaction mode. In order to solve the aforementioned issues, we proposed a dual-stream aggregation network based on multi-scale features, which consists of two main modules, namely a residual context information extraction (RCIE) module and a dense dual-stream aggregation (DDA) module. Firstly, the RCIE module was designed to fully extract context information by connecting features from different receptive fields via residual connections, where convolutional groups composed of asymmetric convolution and dilated convolution are used to extract features from different receptive fields. Secondly, the DDA module aimed to enhance the relationships between different level features by leveraging dense connections to obtain high-quality feature information. Finally, two interaction modes were used for dual-stream aggregation to generate saliency maps. Extensive experiments on 5 benchmark datasets show that the proposed model performs favorably against 15 state-of-the-art methods.

Funder

National Key R&D Program of China

Natural Science Research Project of Colleges and Universities in Anhui Province

Anhui Postdoctoral Science Foundation

National Natural Science Foundation of China

Natural Science Foundation of Anhui Province

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/12/13/2880/pdf

Reference35 articles.

1. Rutishauser, U., Walther, D., Koch, C., and Perona, P. (July, January 27). Is Bottom-Up Attention Useful for Object Recognition?. Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2004), with CD-ROM, Washington, DC, USA.

2. Video Co-Saliency Guided Co-Segmentation;Wang;IEEE Trans. Circuits Syst. Video Technol.,2018

3. Kragic, D., Bicchi, A., and Luca, A.D. (2016, January 16–21). Environment exploration for object-based visual saliency learning. Proceedings of the 2016 IEEE International Conference on Robotics and Automation, ICRA 2016, Stockholm, Sweden.

4. He, J., Feng, J., Liu, X., Cheng, T., Lin, T., Chung, H., and Chang, S. (2012, January 16–21). Mobile product search with Bag of Hash Bits and boundary reranking. Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, USA.

5. Global Contrast Based Salient Region Detection;Cheng;IEEE Trans. Pattern Anal. Mach. Intell.,2015