Temporal Dynamic Concept Modeling Network for Explainable Video Event Recognition

Author:

Zhang Weigang,Qi Zhaobo1,Wang Shuhui2,Su Chi3,Su Li4,Huang Qingming5

Affiliation:

1. Harbin Institute of Technology, Weihai, China

2. Institute of Computing Technology, Chinese Academy of Sciences and Peng Cheng Laboratory, China

3. SmartMore, China

4. University of Chinese Academy of Sciences, China

5. University of Chinese Academy of Sciences, Institute of Computing Technology, Chinese Academy of Sciences and Peng Cheng Laboratory, China

Abstract

Recently, with the vigorous development of deep learning and multimedia technology, intelligent urban computing has received more and more extensive attention from academia and industry. Unfortunately, most of the related technologies are black-box paradigms that lack interpretability. Among them, video event recognition is a basic technology. Event contains multiple concepts and their rich interactions, which can assist us to construct explainable event recognition methods. However, the crucial concepts needed to recognize events have various temporal existing patterns, and the relationship between events and the temporal characteristics of concepts has not been fully exploited. This brings great challenges for concept-based event categorization. To address the above issues, we introduce the temporal concept receptive field, which is the length of the temporal window size required to capture key concepts for concept-based event recognition methods. Accordingly, we introduce the temporal dynamic convolution (TDC) to model the temporal concept receptive field dynamically according to different events. Its core idea is to combine the results of multiple convolution layers with the learned coefficients from two complementary perspectives. These convolution layers contain a variety of kernel sizes, which can provide temporal concept receptive fields of different lengths. Similarly, we also propose the cross-domain temporal dynamic convolution (CrTDC) with the help of the rich relationship between different concepts. Different coefficients can help us to capture suitable temporal concept receptive field sizes and highlight crucial concepts, so as to obtain accurate and complete concept representations for event analysis. Based on the TDC and CrTDC, we introduce the temporal dynamic concept modeling network (TDCMN) for explainable video event recognition. We evaluate TDCMN on large-scale and challenging datasets FCVID, ActivityNet, and CCV. Experimental results show that TDCMN significantly improves the event recognition performance of concept-based methods, and the explainability of our method inspires us to construct more explainable models from the perspective of the temporal concept receptive field.

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Networks and Communications,Hardware and Architecture

Reference85 articles.

1. How Deep Features Have Improved Event Recognition in Multimedia

2. Ensemble of Deep Models for Event Recognition

3. Recognition of Complex Events: Exploiting Temporal Dynamics between Underlying Concepts

4. Egor Burkov and Victor S. Lempitsky. 2018. Deep Neural Networks with Box Convolutions. In Advances in Neural Information Processing Systems Samy Bengio Hanna M. Wallach Hugo Larochelle Kristen Grauman Nicolò Cesa-Bianchi and Roman Garnett (Eds.). 6214–6224. Egor Burkov and Victor S. Lempitsky. 2018. Deep Neural Networks with Box Convolutions. In Advances in Neural Information Processing Systems Samy Bengio Hanna M. Wallach Hugo Larochelle Kristen Grauman Nicolò Cesa-Bianchi and Roman Garnett (Eds.). 6214–6224.

5. ActivityNet: A large-scale video benchmark for human activity understanding

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3