Motion-Attentive Transition for Zero-Shot Video Object Segmentation-Reference-Cited by-同舟云学术

Motion-Attentive Transition for Zero-Shot Video Object Segmentation

Published:2020-04-03 Issue:07 Volume:34 Page:13066-13073
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Zhou Tianfei,Wang Shunzhou,Zhou Yi,Yao Yazhou,Li Jianwu,Shao Ling

Abstract

In this paper, we present a novel Motion-Attentive Transition Network (MATNet) for zero-shot video object segmentation, which provides a new way of leveraging motion information to reinforce spatio-temporal object representation. An asymmetric attention block, called Motion-Attentive Transition (MAT), is designed within a two-stream encoder, which transforms appearance features into motion-attentive representations at each convolutional stage. In this way, the encoder becomes deeply interleaved, allowing for closely hierarchical interactions between object motion and appearance. This is superior to the typical two-stream architecture, which treats motion and appearance separately in each stream and often suffers from overfitting to appearance information. Additionally, a bridge network is proposed to obtain a compact, discriminative and scale-sensitive representation for multi-level encoder features, which is further fed into a decoder to achieve segmentation results. Extensive experiments on three challenging public benchmarks (i.e., DAVIS-16, FBMS and Youtube-Objects) show that our model achieves compelling performance against the state-of-the-arts. Code is available at: https://github.com/tfzhou/MATNet.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 95 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Meta-reinforcement learning for active visual tracking about space non-cooperative object;Multimedia Tools and Applications;2024-08-31

2. Motion perception-driven multimodal self-supervised video object segmentation;The Visual Computer;2024-08-09

3. Self-supervised rigid object 3-D motion estimation from monocular video;Measurement;2024-08

4. Key points trajectory and multi-level depth distinction based refinement for video mirror and glass segmentation;Multimedia Tools and Applications;2024-06-20

5. A Foundation Model for General Moving Object Segmentation in Medical Images;2024 IEEE International Symposium on Biomedical Imaging (ISBI);2024-05-27