Visual Knowledge Graph for Human Action Reasoning in Videos-Reference-Cited by-同舟云学术

Visual Knowledge Graph for Human Action Reasoning in Videos

Published:2022-10-10 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the 30th ACM International Conference on Multimedia
language:
Short-container-title:

Author:

Ma Yue¹,Wang Yali²,Wu Yue³,Lyu Ziyu³,Chen Siran⁴,Li Xiu⁵,Qiao Yu⁶

Affiliation:

1. Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences & Tsinghua University, ShenZhen, China

2. Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences & Shenzhen Institute of Artificial Intelligence and Robotics for Society, ShenZhen, China

3. Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, ShenZhen, China

4. Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences & University of Chinese Academy of Science, ShenZhen, China

5. Tsinghua University, ShenZhen, China

6. Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences &Shanghai AI Laboratory, ShenZhen, China

Funder

National Natural Science Foun- dation of China

the Shanghai Committee of Science and Technology

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3503161.3548257

Reference66 articles.

1. Sami Abu-El-Haija , Nisarg Kothari , Joonseok Lee , Paul Natsev , George Toderici , Balakrishnan Varadarajan , and Sudheendra Vijayanarasimhan . 2016. Youtube-8m: A large-scale video classification benchmark. arXiv preprint arXiv:1609.08675 ( 2016 ). Sami Abu-El-Haija, Nisarg Kothari, Joonseok Lee, Paul Natsev, George Toderici, Balakrishnan Varadarajan, and Sudheendra Vijayanarasimhan. 2016. Youtube-8m: A large-scale video classification benchmark. arXiv preprint arXiv:1609.08675 (2016).

2. ViViT: A Video Vision Transformer

3. Sören Auer , Christian Bizer , Georgi Kobilarov , Jens Lehmann , Richard Cyganiak , and Zachary Ives . 2007 . Dbpedia: A nucleus for a web of open data. In The semantic web . Springer , 722--735. Sören Auer, Christian Bizer, Georgi Kobilarov, Jens Lehmann, Richard Cyganiak, and Zachary Ives. 2007. Dbpedia: A nucleus for a web of open data. In The semantic web. Springer, 722--735.

4. A. Bochkovskiy C. Y. Wang and Hym Liao. 2020. YOLOv4: Optimal Speed and Accuracy of Object Detection. (2020). A. Bochkovskiy C. Y. Wang and Hym Liao. 2020. YOLOv4: Optimal Speed and Accuracy of Object Detection. (2020).

5. Cascade R-CNN: Delving Into High Quality Object Detection

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Action Recognition via Adaptive Semi-Supervised Feature Analysis;Applied Sciences;2023-06-29

2. A Unified Model for Video Understanding and Knowledge Embedding with Heterogeneous Knowledge Graph Dataset;Proceedings of the 2023 ACM International Conference on Multimedia Retrieval;2023-06-12

3. Backdoor Defense via Adaptively Splitting Poisoned Dataset;2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR);2023-06

4. Camouflaged Object Detection with Feature Decomposition and Edge Reconstruction;2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR);2023-06