MFDANet: Multi-Scale Feature Dual-Stream Aggregation Network for Salient Object Detection

Author:

Ge Bin1ORCID,Pei Jiajia1,Xia Chenxing12,Wu Taolin1

Affiliation:

1. College of Computer Science and Engineering, Anhui University of Science and Technology, Huainan 232001, China

2. Institute of Energy, Hefei Comprehensive National Science Center, Hefei 230031, China

Abstract

With the development of deep learning, significant improvements and optimizations have been made in salient object detection. However, many salient object detection methods have limitations, such as insufficient context information extraction, limited interaction modes for different level features, and potential information loss due to a single interaction mode. In order to solve the aforementioned issues, we proposed a dual-stream aggregation network based on multi-scale features, which consists of two main modules, namely a residual context information extraction (RCIE) module and a dense dual-stream aggregation (DDA) module. Firstly, the RCIE module was designed to fully extract context information by connecting features from different receptive fields via residual connections, where convolutional groups composed of asymmetric convolution and dilated convolution are used to extract features from different receptive fields. Secondly, the DDA module aimed to enhance the relationships between different level features by leveraging dense connections to obtain high-quality feature information. Finally, two interaction modes were used for dual-stream aggregation to generate saliency maps. Extensive experiments on 5 benchmark datasets show that the proposed model performs favorably against 15 state-of-the-art methods.

Funder

National Key R&D Program of China

Natural Science Research Project of Colleges and Universities in Anhui Province

Anhui Postdoctoral Science Foundation

National Natural Science Foundation of China

Natural Science Foundation of Anhui Province

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Reference35 articles.

1. Rutishauser, U., Walther, D., Koch, C., and Perona, P. (July, January 27). Is Bottom-Up Attention Useful for Object Recognition?. Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2004), with CD-ROM, Washington, DC, USA.

2. Video Co-Saliency Guided Co-Segmentation;Wang;IEEE Trans. Circuits Syst. Video Technol.,2018

3. Kragic, D., Bicchi, A., and Luca, A.D. (2016, January 16–21). Environment exploration for object-based visual saliency learning. Proceedings of the 2016 IEEE International Conference on Robotics and Automation, ICRA 2016, Stockholm, Sweden.

4. He, J., Feng, J., Liu, X., Cheng, T., Lin, T., Chung, H., and Chang, S. (2012, January 16–21). Mobile product search with Bag of Hash Bits and boundary reranking. Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, USA.

5. Global Contrast Based Salient Region Detection;Cheng;IEEE Trans. Pattern Anal. Mach. Intell.,2015

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3