Salient object detection in egocentric videos-Reference-Cited by-同舟云学术

Salient object detection in egocentric videos

Published:2024-03-13 Issue:8 Volume:18 Page:2028-2037
ISSN:1751-9659
Container-title:IET Image Processing
language:en
Short-container-title:IET Image Processing

Author:

Zhang Hao¹^ORCID,Liang Haoran¹,Zhao Xing¹,Liu Jian¹,Liang Ronghua¹

Affiliation:

1. Zhejiang University of Technology Hangzhou China

Abstract

AbstractIn the realm of video salient object detection (VSOD), the majority of research has traditionally been centered on third‐person perspective videos. However, this focus overlooks the unique requirements of certain first‐person tasks, such as autonomous driving or robot vision. To bridge this gap, a novel dataset and a camera‐based VSOD model, CaMSD, specifically designed for egocentric videos, is introduced. First, the SalEgo dataset, comprising 17,400 fully annotated frames for video salient object detection, is presented. Second, a computational model that incorporates a camera movement module is proposed, designed to emulate the patterns observed when humans view videos. Additionally, to achieve precise segmentation of a single salient object during switches between salient objects, as opposed to simultaneously segmenting two objects, a saliency enhancement module based on the Squeeze and Excitation Block is incorporated. Experimental results show that the approach outperforms other state‐of‐the‐art methods in egocentric video salient object detection tasks. Dataset and codes can be found at https://github.com/hzhang1999/SalEgo.

Funder

National Natural Science Foundation of China

Publisher

Institution of Engineering and Technology (IET)

Reference58 articles.

1. Segmentation of Moving Objects by Long Term Video Analysis

2. Li F. Kim T. Humayun A. Tsai D. Rehg J.M.:Video segmentation by tracking many figure‐ground segments. In:Proceedings of the IEEE International Conference on Computer Vision pp.2192–2199.IEEE Piscataway(2013)

3. Perazzi F. Pont‐Tuset J. McWilliams B. Van Gool L. Gross M. Sorkine‐Hornung A.:A benchmark dataset and evaluation methodology for video object segmentation. In:Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition pp.724–732.IEEE Piscataway(2016)

4. Consistent Video Saliency Using Local Gradient Flow Optimization and Global Refinement

5. Spatiotemporal Saliency Detection for Video Sequences Based on Random Walk With Restart