1. Where to Focus on for Human Action Recognition?
2. Attention is all you need;vaswani;Advances in neural information processing systems,2017
3. Cross-lingual language model pretraining;conneau;Advances in neural information processing systems,2019
4. A Closer Look at Spatiotemporal Convolutions for Action Recognition
5. An image is worth 16x16 words: Transformers for image recognition at scale;dosovitskiy;International Conference on Learning Representations,2020