RGB-D Salient Object Detection via 3D Convolutional Neural Networks-Reference-Cited by-同舟云学术

RGB-D Salient Object Detection via 3D Convolutional Neural Networks

Published:2021-05-18 Issue:2 Volume:35 Page:1063-1071
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Chen Qian,Liu Ze,Zhang Yi,Fu Keren,Zhao Qijun,Du Hongwei

Abstract

RGB-D salient object detection (SOD) recently has attracted increasing research interest and many deep learning methods based on encoder-decoder architectures have emerged. However, most existing RGB-D SOD models conduct feature fusion either in the single encoder or the decoder stage, which hardly guarantees sufficient cross-modal fusion ability. In this paper, we make the first attempt in addressing RGB-D SOD through 3D convolutional neural networks. The proposed model, named RD3D, aims at pre-fusion in the encoder stage and in-depth fusion in the decoder stage to effectively promote the full integration of RGB and depth streams. Specifically, RD3D first conducts pre-fusion across RGB and depth modalities through an inflated 3D encoder, and later provides in-depth feature fusion by designing a 3D decoder equipped with rich back-projection paths (RBPP) for leveraging the extensive aggregation ability of 3D convolutions. With such a progressive fusion strategy involving both the encoder and decoder, effective and thorough interaction between the two modalities can be exploited and boost the detection accuracy. Extensive experiments on six widely used benchmark datasets demonstrate that RD3D performs favorably against 14 state-of-the-art RGB-D SOD approaches in terms of four key evaluation metrics. Our code will be made publicly available: https://github.com/PPOLYpubki/RD3D.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 80 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Progressive expansion for semi-supervised bi-modal salient object detection;Pattern Recognition;2025-01

2. Incomplete RGB-D salient object detection: Conceal, correlate and fuse;Pattern Recognition;2024-11

3. Degradation-removed multiscale fusion for low-light salient object detection;Pattern Recognition;2024-11

4. A foreground-context dual-guided network for light-field salient object detection;Signal Processing: Image Communication;2024-10

5. Transformer-based cross-modality interaction guidance network for RGB-T salient object detection;Neurocomputing;2024-10