Less is More: Decoupled High-Semantic Encoding for Action Recognition-Reference-Cited by-同舟云学术

Less is More: Decoupled High-Semantic Encoding for Action Recognition

Published:2023-06-12 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the 2023 ACM International Conference on Multimedia Retrieval
language:
Short-container-title:

Author:

Zhang Chun¹^ORCID,Ren Keyan¹^ORCID,Bian Qingyun¹^ORCID,Shi Yu²^ORCID

Affiliation:

1. Faculty of Information Technology, Beijing University of Technology, China

2. Faculty of Information Technology,, Beijing University of Technology, China

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3591106.3592233

Reference37 articles.

1. Anurag Arnab , Mostafa Dehghani , Georg Heigold , Chen Sun , Mario Lucic , and Cordelia Schmid . 2021 . ViViT: A Video Vision Transformer. In 2021 IEEE/CVF International Conference on Computer Vision, ICCV 2021 , Montreal, QC, Canada , October 10-17, 2021. IEEE, 6816–6826. https://doi.org/10.1109/ICCV48922.2021.00676 10.1109/ICCV48922.2021.00676 Anurag Arnab, Mostafa Dehghani, Georg Heigold, Chen Sun, Mario Lucic, and Cordelia Schmid. 2021. ViViT: A Video Vision Transformer. In 2021 IEEE/CVF International Conference on Computer Vision, ICCV 2021, Montreal, QC, Canada, October 10-17, 2021. IEEE, 6816–6826. https://doi.org/10.1109/ICCV48922.2021.00676

2. Abdelhadi Azzouni and Guy Pujolle . 2017. A Long Short-Term Memory Recurrent Neural Network Framework for Network Traffic Matrix Prediction. CoRR abs/1705.05690 ( 2017 ). arXiv:1705.05690http://arxiv.org/abs/1705.05690 Abdelhadi Azzouni and Guy Pujolle. 2017. A Long Short-Term Memory Recurrent Neural Network Framework for Network Traffic Matrix Prediction. CoRR abs/1705.05690 (2017). arXiv:1705.05690http://arxiv.org/abs/1705.05690

3. Hangbo Bao , Li Dong , Songhao Piao , and Furu Wei . 2022 . BEiT: BERT Pre-Training of Image Transformers. In The Tenth International Conference on Learning Representations, ICLR 2022 , Virtual Event , April 25-29, 2022. OpenReview.net. https://openreview.net/forum?id=p-BhZSz59o4 Hangbo Bao, Li Dong, Songhao Piao, and Furu Wei. 2022. BEiT: BERT Pre-Training of Image Transformers. In The Tenth International Conference on Learning Representations, ICLR 2022, Virtual Event, April 25-29, 2022. OpenReview.net. https://openreview.net/forum?id=p-BhZSz59o4

4. Gedas Bertasius , Heng Wang , and Lorenzo Torresani . 2021 . Is Space-Time Attention All You Need for Video Understanding? . In Proceedings of the International Conference on Machine Learning (ICML). Gedas Bertasius, Heng Wang, and Lorenzo Torresani. 2021. Is Space-Time Attention All You Need for Video Understanding?. In Proceedings of the International Conference on Machine Learning (ICML).

5. Nicolas Carion , Francisco Massa , Gabriel Synnaeve , Nicolas Usunier , Alexander Kirillov , and Sergey Zagoruyko . 2020. End-to-End Object Detection with Transformers. CoRR abs/2005.12872 ( 2020 ). arXiv:2005.12872https://arxiv.org/abs/2005.12872 Nicolas Carion, Francisco Massa, Gabriel Synnaeve, Nicolas Usunier, Alexander Kirillov, and Sergey Zagoruyko. 2020. End-to-End Object Detection with Transformers. CoRR abs/2005.12872 (2020). arXiv:2005.12872https://arxiv.org/abs/2005.12872

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Geospatial Knowledge Hypercube;Proceedings of the 31st ACM International Conference on Advances in Geographic Information Systems;2023-11-13