Affiliation:
1. School of Innovation and Entrepreneurship, Dalian University of Technology, Dalian 116024, China
2. School of Computer Science and Technology, Dalian University of Technology, Dalian 116024, China
3. School of Information and Communication Engineering, Dalian University of Technology, Dalian 116024, China
Abstract
Detecting salient objects in complicated scenarios is a challenging problem. Except for semantic features from the RGB image, spatial information from the depth image also provides sufficient cues about the object. Therefore, it is crucial to rationally integrate RGB and depth features for the RGB-D salient object detection task. Most existing RGB-D saliency detectors modulate RGB semantic features with absolution depth values. However, they ignore the appearance contrast and structure knowledge indicated by relative depth values between pixels. In this work, we propose a depth-induced network (DIN) for RGB-D salient object detection, to take full advantage of both absolute and relative depth information, and further, enforce the in-depth fusion of the RGB-D cross-modalities. Specifically, an absolute depth-induced module (ADIM) is proposed, to hierarchically integrate absolute depth values and RGB features, to allow the interaction between the appearance and structural information in the encoding stage. A relative depth-induced module (RDIM) is designed, to capture detailed saliency cues, by exploring contrastive and structural information from relative depth values in the decoding stage. By combining the ADIM and RDIM, we can accurately locate salient objects with clear boundaries, even from complex scenes. The proposed DIN is a lightweight network, and the model size is much smaller than that of state-of-the-art algorithms. Extensive experiments on six challenging benchmarks, show that our method outperforms most existing RGB-D salient object detection models.
Funder
Ministry of Science and Technology of China
National Natural Science Foundation of China
Fundamental Research Funds for the Central Universities
National Key R&D Program of China
Subject
Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry
Reference69 articles.
1. Region-Based Saliency Detection and Its Application in Object Recognition;Ren;IEEE Trans. Circuits Syst. Video Technol.,2014
2. Rapid Biologically-Inspired Scene Classification Using Features Shared with Visual Attention;Siagian;IEEE Trans. Pattern Anal. Mach. Intell.,2007
3. Biologically Inspired Object Tracking Using Center-surround Saliency Mechanisms;Mahadevan;IEEE Trans. Pattern Anal. Mach. Intell.,2013
4. Borji, A., Frintrop, S., Sihite, D.N., and Itti, L. (2012, January 16–21). Adaptive object tracking by learning background context. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, Providence, RI, USA.
5. Computer vision detection of foreign objects in coal processing using attention CNN;Zhang;Eng. Appl. Artif. Intell.,2021
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献