Specificity-preserving RGB-D saliency detection-Reference-Cited by-同舟云学术

Specificity-preserving RGB-D saliency detection

Published:2023-01-03 Issue:2 Volume:9 Page:297-317
ISSN:2096-0433
Container-title:Computational Visual Media
language:en
Short-container-title:Comp. Visual Media

Author:

Zhou Tao,Fan Deng-Ping,Chen Geng,Zhou Yi,Fu Huazhu

Abstract

AbstractSalient object detection (SOD) in RGB and depth images has attracted increasing research interest. Existing RGB-D SOD models usually adopt fusion strategies to learn a shared representation from RGB and depth modalities, while few methods explicitly consider how to preserve modality-specific characteristics. In this study, we propose a novel framework, the specificity-preserving network (SPNet), which improves SOD performance by exploring both the shared information and modality-specific properties. Specifically, we use two modality-specific networks and a shared learning network to generate individual and shared saliency prediction maps. To effectively fuse cross-modal features in the shared learning network, we propose a cross-enhanced integration module (CIM) and propagate the fused feature to the next layer to integrate cross-level information. Moreover, to capture rich complementary multi-modal information to boost SOD performance, we use a multi-modal feature aggregation (MFA) module to integrate the modality-specific features from each individual decoder into the shared decoder. By using skip connections between encoder and decoder layers, hierarchical features can be fully combined. Extensive experiments demonstrate that our SPNet outperforms cutting-edge approaches on six popular RGB-D SOD and three camouflaged object detection benchmarks. The project is publicly available at https://github.com/taozh2017/SPNet.

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence,Computer Graphics and Computer-Aided Design,Computer Vision and Pattern Recognition

Link

https://link.springer.com/content/pdf/10.1007/s41095-022-0268-6.pdf

Reference105 articles.

1. Lecture Notes in Computer Science;H Peng,2014

2. Zhu, J.-Y.; Wu, J.-J.; Xu, Y.; Chang, E.; Tu, Z. W. Unsupervised object class discovery via saliency-guided multiple class learning. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 37, No. 4, 862–875, 2015.

3. Rapantzikos, K.; Avrithis, Y.; Kollias, S. Dense saliency-based spatiotemporal feature points for action recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 1454–1461, 2009.

4. Lecture Notes in Computer Science;W Shimoda,2016

5. Wang, W. G.; Shen, J. B.; Yang, R. G.; Porikli, F. Saliency-aware video object segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 40, No. 1, 20–33, 2018.

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Cross-Modal Adaptive Interaction Network for RGB-D Saliency Detection;Applied Sciences;2024-08-23

2. Message from the Best Paper Award Committee;Computational Visual Media;2024-05-14

3. RGB-D Visual Saliency Detection Algorithm Based on Information Guided and Multimodal Feature Fusion;IEEE Access;2024

4. Cross-modal hierarchical interaction network for RGB-D salient object detection;Pattern Recognition;2023-04

5. CAVER: Cross-Modal View-Mixed Transformer for Bi-Modal Salient Object Detection;IEEE Transactions on Image Processing;2023