Multi-Scale Locality-Constrained Spatiotemporal Coding for Local Feature Based Human Action Recognition-Reference-Cited by-同舟云学术

Multi-Scale Locality-Constrained Spatiotemporal Coding for Local Feature Based Human Action Recognition

Published:2013 Issue: Volume:2013 Page:1-11
ISSN:1537-744X
Container-title:The Scientific World Journal
language:en
Short-container-title:The Scientific World Journal

Author:

Wang Bin¹^ORCID,Liu Yu¹,Wang Wei¹,Xu Wei¹,Zhang Maojun¹

Affiliation:

1. College of Information System and Manage, National University of Defense Technology, 109 Deya Road, Changsha, Hunan 410073, China

Abstract

We propose a Multiscale Locality-Constrained Spatiotemporal Coding (MLSC) method to improve the traditional bag of features (BoF) algorithm which ignores the spatiotemporal relationship of local features for human action recognition in video. To model this spatiotemporal relationship, MLSC involves the spatiotemporal position of local feature into feature coding processing. It projects local features into a sub space-time-volume (sub-STV) and encodes them with a locality-constrained linear coding. A group of sub-STV features obtained from one video with MLSC and max-pooling are used to classify this video. In classification stage, the Locality-Constrained Group Sparse Representation (LGSR) is adopted to utilize the intrinsic group information of these sub-STV features. The experimental results on KTH, Weizmann, and UCF sports datasets show that our method achieves better performance than the competing local spatiotemporal feature-based human action recognition methods.

Funder

National Natural Science Foundation of China

Publisher

Hindawi Limited

Subject

General Environmental Science,General Biochemistry, Genetics and Molecular Biology,General Medicine

Link

http://downloads.hindawi.com/journals/tswj/2013/405645.pdf

Reference26 articles.

1. Memory-Based Multiagent Coevolution Modeling for Robust Moving Object Tracking

2. Structured learning of local features for human action classification and localization

3. A sequence-action recognition applying state machine for user interface