1. Jacob Devlin , Ming-Wei Chang , Kenton Lee , and Kristina Toutanova . 2019 . BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In NAACL-HLT. 4171--4186. Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In NAACL-HLT. 4171--4186.
2. Victor Escorcia , Mattia Soldan , Josef Sivic , Bernard Ghanem , and Bryan C . Russell . 2019 . Temporal Localization of Moments in Video Collections with Natural Language. CoRR abs/1907.12763 (2019). Victor Escorcia, Mattia Soldan, Josef Sivic, Bernard Ghanem, and Bryan C. Russell. 2019. Temporal Localization of Moments in Video Collections with Natural Language. CoRR abs/1907.12763 (2019).
3. Christoph Feichtenhofer Haoqi Fan Jitendra Malik and Kaiming He. 2019. Slow- Fast Networks for Video Recognition. In ICCV. 6201--6210. Christoph Feichtenhofer Haoqi Fan Jitendra Malik and Kaiming He. 2019. Slow- Fast Networks for Video Recognition. In ICCV. 6201--6210.
4. Jiyang Gao , Chen Sun , Zhenheng Yang , and Ram Nevatia . 2017 . TALL: Temporal Activity Localization via Language Query. In ICCV. 5277--5285. Jiyang Gao, Chen Sun, Zhenheng Yang, and Ram Nevatia. 2017. TALL: Temporal Activity Localization via Language Query. In ICCV. 5277--5285.
5. Junyu Gao and Changsheng Xu. 2021. Fast Video Moment Retrieval. In ICCV. 1503--1512. Junyu Gao and Changsheng Xu. 2021. Fast Video Moment Retrieval. In ICCV. 1503--1512.