Less is More: Decoupled High-Semantic Encoding for Action Recognition

Author:

Zhang Chun1ORCID,Ren Keyan1ORCID,Bian Qingyun1ORCID,Shi Yu2ORCID

Affiliation:

1. Faculty of Information Technology, Beijing University of Technology, China

2. Faculty of Information Technology,, Beijing University of Technology, China

Publisher

ACM

Reference37 articles.

1. Anurag Arnab , Mostafa Dehghani , Georg Heigold , Chen Sun , Mario Lucic , and Cordelia Schmid . 2021 . ViViT: A Video Vision Transformer. In 2021 IEEE/CVF International Conference on Computer Vision, ICCV 2021 , Montreal, QC, Canada , October 10-17, 2021. IEEE, 6816–6826. https://doi.org/10.1109/ICCV48922.2021.00676 10.1109/ICCV48922.2021.00676 Anurag Arnab, Mostafa Dehghani, Georg Heigold, Chen Sun, Mario Lucic, and Cordelia Schmid. 2021. ViViT: A Video Vision Transformer. In 2021 IEEE/CVF International Conference on Computer Vision, ICCV 2021, Montreal, QC, Canada, October 10-17, 2021. IEEE, 6816–6826. https://doi.org/10.1109/ICCV48922.2021.00676

2. Abdelhadi Azzouni and Guy Pujolle . 2017. A Long Short-Term Memory Recurrent Neural Network Framework for Network Traffic Matrix Prediction. CoRR abs/1705.05690 ( 2017 ). arXiv:1705.05690http://arxiv.org/abs/1705.05690 Abdelhadi Azzouni and Guy Pujolle. 2017. A Long Short-Term Memory Recurrent Neural Network Framework for Network Traffic Matrix Prediction. CoRR abs/1705.05690 (2017). arXiv:1705.05690http://arxiv.org/abs/1705.05690

3. Hangbo Bao , Li Dong , Songhao Piao , and Furu Wei . 2022 . BEiT: BERT Pre-Training of Image Transformers. In The Tenth International Conference on Learning Representations, ICLR 2022 , Virtual Event , April 25-29, 2022. OpenReview.net. https://openreview.net/forum?id=p-BhZSz59o4 Hangbo Bao, Li Dong, Songhao Piao, and Furu Wei. 2022. BEiT: BERT Pre-Training of Image Transformers. In The Tenth International Conference on Learning Representations, ICLR 2022, Virtual Event, April 25-29, 2022. OpenReview.net. https://openreview.net/forum?id=p-BhZSz59o4

4. Gedas Bertasius , Heng Wang , and Lorenzo Torresani . 2021 . Is Space-Time Attention All You Need for Video Understanding? . In Proceedings of the International Conference on Machine Learning (ICML). Gedas Bertasius, Heng Wang, and Lorenzo Torresani. 2021. Is Space-Time Attention All You Need for Video Understanding?. In Proceedings of the International Conference on Machine Learning (ICML).

5. Nicolas Carion , Francisco Massa , Gabriel Synnaeve , Nicolas Usunier , Alexander Kirillov , and Sergey Zagoruyko . 2020. End-to-End Object Detection with Transformers. CoRR abs/2005.12872 ( 2020 ). arXiv:2005.12872https://arxiv.org/abs/2005.12872 Nicolas Carion, Francisco Massa, Gabriel Synnaeve, Nicolas Usunier, Alexander Kirillov, and Sergey Zagoruyko. 2020. End-to-End Object Detection with Transformers. CoRR abs/2005.12872 (2020). arXiv:2005.12872https://arxiv.org/abs/2005.12872

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Geospatial Knowledge Hypercube;Proceedings of the 31st ACM International Conference on Advances in Geographic Information Systems;2023-11-13

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3