1. Deep Temporal Linear Encoding Networks;diba;Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition,2016
2. The kinetics human action video dataset;kay,2017
3. P-CNN: Pose-based CNN Features for Action Recognition;chéron;Proceedings of the IEEE International Conference on Computer Vision,2015
4. NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis
5. ActionVLAD: Learning spatio-temporal aggregation for action classification;girdhar;Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition,2017