Author:
Tian Fengrui,Fan Jiawei,Yu Xie,Du Shaoyi,Song Meina,Zhao Yu
Publisher
Springer Nature Switzerland
Reference47 articles.
1. Ahsan, U., Madhok, R., Essa, I.: Video Jigsaw: unsupervised learning of spatiotemporal context for video action recognition. In: 2019 IEEE Winter Conference on Applications of Computer Vision (WACV), pp. 179–189 (2019). https://doi.org/10.1109/WACV.2019.00025
2. Alwassel, H., Mahajan, D., Korbar, B., Torresani, L., Ghanem, B., Tran, D.: Self-supervised learning by cross-modal audio-video clustering. In: NeurIPS (2020)
3. Benaim, S., et al.: SpeedNet: learning the speediness in videos. In: CVPR, pp. 9922–9931 (2020)
4. Biondi, F.N., Alvarez, I.J., Jeong, K.A.: Human-vehicle cooperation in automated driving: a multidisciplinary review and appraisal. Int. J. Hum.-Comput. Interact. 35, 932–946 (2019)
5. Carreira, J., Zisserman, A.: Quo Vadis, action recognition? A new model and the kinetics dataset. In: CVPR, pp. 6299–6308 (2017)