1. SIMONYAN K ZISSERMAN A. 2022. Two-stream convolutional networks for action recognition in videos [EB/OL].
2. Video Action Transformer Network
3. CARREIRA J, ZISSERMAN A. 2017. Quo vadis, actionrecognition? A new model and the kinetics dataset [C] //Proceedings of 2017 IEEE Conference on Computer Visionand Pattern Recognition. Honolulu, USA: IEEE Press.
4. Feichtenhofer, Christoph 2019. “SlowFast Networks for Video Recognition.”IEEE/CVF International Conference on Computer Vision (ICCV).
5. WOO S, PARK J, LEE J Y, 2018. CBAM: convolutional block attention module [C] //Proceedings of International Conference on Computer Vision. Washington D. C., USA: IEEE Press.