Feature interaction and two-stage cross-modal fusion for RGB-D salient object detection-Reference-Cited by-同舟云学术

Feature interaction and two-stage cross-modal fusion for RGB-D salient object detection

Published:2023-08-14 Issue: Volume: Page:1-14
ISSN:1064-1246
Container-title:Journal of Intelligent & Fuzzy Systems
language:
Short-container-title:IFS

Author:

Yu Ming¹,Liu Jiali¹,Liu Yi¹,Yan Gang¹

Affiliation:

1. School of Artificial Intelligence, Hebei University of Technology, Tianjin, China

Abstract

Most existing RGB-D salient object detection (SOD) methods extract features of both modalities in parallel or adopt depth features as supplementary information for unidirectional interaction from depth modality to RGB modality in the encoder stage. These methods ignore the influence of low-quality depth maps, and there is still room for improvement in effectively fusing RGB features and depth features. To address the above problems, this paper proposes a Feature Interaction Network (FINet), which performs bi-directional interaction through feature interaction module (FIM) in the encoder stage. The feature interaction module is divided into two parts: depth enhancement module (DEM) filters the noise in the depth features through the attention mechanism; and cross enhancement module (CEM) effectively interacts RGB features and depth features. In addition, this paper proposes a two-stage cross-modal fusion strategy: high-level fusion adopts the semantic information of high level for coarse localization of salient regions, and low-level fusion makes full use of the detailed information of low level through boundary fusion, and then we progressively refine high-level and low-level cross-modal features to obtain the final saliency prediction map. Extensive experiments show that the proposed model achieves better performance than eight state-of-the-art models on five standard datasets.

Publisher

IOS Press

Subject

Artificial Intelligence,General Engineering,Statistics and Probability

Reference20 articles.

1. Gao Yue , Wang Meng , Tao Dacheng , et al. 3-D object retrieval and recognition with hypergraph analysis. [J], IEEE transactions on image processing: a publication of the IEEE Signal Processing Society 21(9) (2012).

2. Component identification and defect detection in transmission lines based on deep learning [J];Zheng;Journal of Intelligent & Fuzzy Systems,2021

3. Icnet: Information conversion network for RGB-D based salient object detection;Li;IEEE Trans. Image Process,2020

4. BTS-Net: Bi-Directional Transfer-And-Selection Network for RGB-D Salient Object Detection

5. CDNet: Complementary Depth Network for RGB-D Salient Object Detection;Jin;in IEEE Transactions on Image Processing,2021

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Advancing in RGB-D Salient Object Detection: A Survey;Applied Sciences;2024-09-09

2. Optimized deep belief network and unsupervised deep learning methods for disease prediction;Journal of Intelligent & Fuzzy Systems;2023-12-02