Author:
Diba Ali,Fayyaz Mohsen,Sharma Vivek,Paluri Manohar,Gall Jürgen,Stiefelhagen Rainer,Van Gool Luc
Publisher
Springer International Publishing
Reference64 articles.
1. Google Vision AI API. cloud.google.com/vision
2. Sensifai Video Tagging API. www.sensifai.com
3. Abu-El-Haija, S., et al.: Youtube-8m: a large-scale video classification benchmark. arXiv:1609.08675 (2016)
4. Andriluka, M., Pishchulin, L., Gehler, P., Schiele, B.: 2D human pose estimation: new benchmark and state of the art analysis. In: CVPR (2014)
5. Caba Heilbron, F., Escorcia, V., Ghanem, B., Carlos Niebles, J.: ActivityNet: a large-scale video benchmark for human activity understanding. In: CVPR (2015)
Cited by
54 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. MM-AU:Towards Multimodal Understanding of Advertisement Videos;Proceedings of the 31st ACM International Conference on Multimedia;2023-10-26
2. ChinaOpen: A Dataset for Open-world Multimodal Learning;Proceedings of the 31st ACM International Conference on Multimedia;2023-10-26
3. Video Turing Test: A first step towards human‐level AI;AI Magazine;2023-10-17
4. Spatio-Temporal Convolution-Attention Video Network;2023 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW);2023-10-02
5. Traffic Incident Database with Multiple Labels Including Various Perspective Environmental Information;2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS);2023-10-01