1. Real-time video super-resolution with spatio-temporal networks and motion compensation;Caballero,2017
2. Quo vadis, action recognition? a new model and the kinetics dataset;Carreira,2017
3. Rethinking the faster r-cnn architecture for temporal action localization;Chao,2018
4. Relation distillation networks for video object detection;Deng,2019
5. J. Devlin., M.W. Chang, K. Lee, K. Toutanova, Bert: Pre-training of deep bidirectional transformers for language understanding, 2018. arXiv preprint arXiv:1810.04805.