Video object segmentation via couple streams and feature memory-Reference-Cited by-同舟云学术

Video object segmentation via couple streams and feature memory

Published:2024-04-17 Issue:9 Volume:18 Page:2257-2272
ISSN:1751-9659
Container-title:IET Image Processing
language:en
Short-container-title:IET Image Processing

Author:

Liang Yun¹^ORCID,Xiao Xinjie¹^ORCID,Qiu Shaojian¹,Zhang Yuqing²^ORCID,Su Zhuo³

Affiliation:

1. College of Mathematics and Informatics South China Agricultural University Guangzhou China

2. School of Control Science and Engineering Beijing University of Technology Beijing China

3. School of Control Science and Engineering Sun Yat‐sen University Guangzhou China

Abstract

AbstractIn recent years, most video segmentation methods use deep CNN to process the input image, but they did not fully mine the rich intermediate predictions in spatio‐temporal space. And, the segmentation challenges such as occlusion, severe deformation and illumination have not been well solved so far. To alleviate these problems, this paper focuses on constructing multi module network structures that represent multi semantics and proposes a video object segmentation network via coupled‐stream architecture with feature memory mechanism. This network first extracts high‐level semantic features, edge features, long‐term and short‐term stable depth features of the target, and then decode them into the segmentation mask of target. In addition, negative skeleton inhibition and frame interpolation are used to prevent the interference of similar objects and motion blur, respectively. The method has a low GPU memory usage, regardless of the number of object in video. And performs 86.5%and 62.4% in J&F measure on DAVIS 2016 and DAVIS 2017 validation set, without fine‐tuning and online training.

Funder

National Natural Science Foundation of China

Science and Technology Planning Project of Guangdong Province

Publisher

Institution of Engineering and Technology (IET)

Reference62 articles.

1. Superpixel Labeling Priors and MRF for Aerial Video Segmentation

2. Joint Video Object Discovery and Segmentation by Coupled Dynamic Markov Networks

3. Higher-order potentials for video object segmentation in bilateral space

4. Ding M. Wang Z. Zhou B. Shi J. Lu Z. Luo P.:Every frame counts: joint learning of video segmentation and optical flow. In:Proceedings of the AAAI Conference on Artificial Intelligence vol.34 pp.10713–10720.AAAI Press Menlo Park CA(2020)

5. Cheng J. Tsai Y.H. Wang S. Yang M.H.:Segflow: Joint learning for video object segmentation and optical flow. In:Proceedings of the IEEE International Conference on Computer Vision pp.686–695.IEEE Piscataway(2017)