Semantic-Consistency-guided Learning on Deep Features for Unsupervised Salient Object Detection-Reference-Cited by-同舟云学术

Semantic-Consistency-guided Learning on Deep Features for Unsupervised Salient Object Detection

Published:2024-03-08 Issue:6 Volume:20 Page:1-23
ISSN:1551-6857
Container-title:ACM Transactions on Multimedia Computing, Communications, and Applications
language:en
Short-container-title:ACM Trans. Multimedia Comput. Commun. Appl.

Author:

Zhang Ying Ying¹^ORCID,Zhang Shuo²^ORCID,Hui Ming¹^ORCID

Affiliation:

1. School of Physics and Electronic Engineering, Nanyang Normal University, Henan Engineering Research Center for Radio Frequency Front End and Antenna of Millimeter Wave Wireless Communication System, Nan Yang, China

2. School of Computer and Information Technology, Beijing Jiaotong University, Beijing, China

Abstract

Unsupervised salient object detection is an important task in many real-world scenarios where pixel-wise label information is of scarce availability. Despite its significance, this problem remains rarely explored, with a few works that consider unsupervised salient object detection methods based on the fused graph from the sum fusion of multiple deep feature similarity matrices. However, these methods ignore the interrelation of the low-level feature similarity matrices and the high-level semantic similarity matrice, which degrades the quality of the fused graph. In this article, we propose a semantic-consistency-guided multi-graph fusion learning algorithm for unsupervised saliency detection, where the consistency and inconsistency between multiple low-level feature similarity matrices and the high-level semantic similarity matrice are explored to promote the robustness and quality of the fused graph. In the first stage, a semantic-consistency-guided multi-graph fusion learning method is proposed to exploit consistency and inconsistency of multiple low-level deep features and the high-level semantic feature. The semantic-consistency-guided similarity matrices are computed for preliminary saliency ranking. In the following saliency refinement stage, the semantic-enhanced similarity matrices are built by the cross diffusion to fuse the multiple low-level deep features and the high semantic deep feature. Based on the semantic-enhanced similarity matrices, the refinement saliency maps are calculated in a semantic-enhanced cellular automata manner. Furthermore, the final ensemble stage of the large margin semi-supervised classification views the preliminary ranking results and refinement results as features, adopts the large margin graphs for saliency ensemble. Extensive evaluations over four benchmark datasets show that the proposed unsupervised method performs favorably against the state-of-the-art approaches and is competitive with some supervised deep learning-based methods.

Funder

National Science Foundation of China

Henan Province University Science and Technology Innovation Talent Support Program

scientific and technological project in Henan Province of China

Cultivating Fund Project of the National Science Foundation

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/3640816

Reference62 articles.

1. Salient Object Detection: A Benchmark

2. Global Contrast Based Salient Region Detection

3. Lijuan Duan, Chunpeng Wu, Jun Miao, and Laiyun Qing. 2011. Visual saliency detection by spatially weighted dissimilarity. In Computer Vision and Pattern Recognition. 473–480.

4. Structure-Measure: A New Way to Evaluate Foreground Maps

5. Densely nested top-down flows for salient object detection