1. Saliency guided local and global descriptors for effective action recognition;Abdulmunem;Computational Visual Media,2016
2. Sami Abu-El-Haija, Nisarg Kothari, Joonseok Lee, Paul Natsev, George Toderici, Balakrishnan Varadarajan, and Sudheendra Vijayanarasimhan. Youtube-8M: A large-scale video classification benchmark. arXiv preprint arXiv:1609.08675, 2016.
3. Look, listen and learn;Arandjelovic,2017
4. Yunlong Bian, Chuang Gan, Xiao Liu, Fu Li, Xiang Long, Yandong Li, Heng Qi, Jie Zhou, Shilei Wen, and Yuanqing Lin. Revisiting the effectiveness of off-the-shelf temporal modeling approaches for large-scale video classification. arXiv preprint arXiv:1708.03805, 2017.
5. Quo vadis, action recognition? a new model and the Kinetics dataset;Carreira,2017