Segmentation of Discriminative Patches in Human Activity Video-Reference-Cited by-同舟云学术

Segmentation of Discriminative Patches in Human Activity Video

Published:2015-08-24 Issue:1 Volume:12 Page:1-19
ISSN:1551-6857
Container-title:ACM Transactions on Multimedia Computing, Communications, and Applications
language:en
Short-container-title:ACM Trans. Multimedia Comput. Commun. Appl.

Author:

Zhang Bo¹,Conci Nicola¹,De Natale Francesco G.B.¹

Affiliation:

1. University of Trento, Italy

Abstract

In this article, we present a novel approach to segment discriminative patches in human activity videos. First, we adopt the spatio-temporal interest points (STIPs) to represent significant motion patterns in the video sequence. Then, nonnegative sparse coding is exploited to generate a sparse representation of each STIP descriptor. We construct the feature vector for each video by applying a two-stage sum-pooling and l 2 -normalization operation. After training a multi-class classifier through the error-correcting code SVM, the discriminative portion of each video is determined as the patch that has the highest confidence while also being correctly classified according to the video category. Experimental results show that the video patches extracted by our method are more separable, while preserving the perceptually relevant portion of each activity.

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Networks and Communications,Hardware and Architecture

Link

https://dl.acm.org/doi/pdf/10.1145/2750780

Reference44 articles.

1. Video-Based Human Behavior Understanding: A Survey

2. Sparse Modeling of Human Actions from Motion Imagery

3. Tracking video objects in cluttered background

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Planar Reconstruction of Indoor Scenes from Sparse Views and Relative Camera Poses;Remote Sensing;2024-04-30

2. Data-driven enabled approaches for criteria-based video summarization: a comprehensive survey, taxonomy, and future directions;Multimedia Tools and Applications;2023-03-02

3. IDEA-Net: Dynamic 3D Point Cloud Interpolation via Deep Embedding Alignment;2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR);2022-06

4. Single-stage Instance Segmentation;ACM Transactions on Multimedia Computing, Communications, and Applications;2020-09-04

5. Evaluation of the integrated multi-satellite retrievals for global precipitation measurement over the Tibetan Plateau;Journal of Mountain Science;2019-07