How Deep Features Have Improved Event Recognition in Multimedia-Reference-Cited by-同舟云学术

How Deep Features Have Improved Event Recognition in Multimedia

Published:2019-06-14 Issue:2 Volume:15 Page:1-27
ISSN:1551-6857
Container-title:ACM Transactions on Multimedia Computing, Communications, and Applications
language:en
Short-container-title:ACM Trans. Multimedia Comput. Commun. Appl.

Author:

Ahmad Kashif¹,Conci Nicola¹

Affiliation:

1. University of Trento, Trento, Italy

Abstract

Event recognition is one of the areas in multimedia that is attracting great attention of researchers. Being applicable in a wide range of applications, from personal to collective events, a number of interesting solutions for event recognition using multimedia information sources have been proposed. On the other hand, following their immense success in classification, object recognition, and detection, deep learning has been shown to perform well in event recognition tasks also. Thus, a large portion of the literature on event analysis relies nowadays on deep learning architectures. In this article, we provide an extensive overview of the existing literature in this field, analyzing how deep features and deep learning architectures have changed the performance of event recognition frameworks. The literature on event-based analysis of multimedia contents can be categorized into four groups, namely (i) event recognition in single images; (ii) event recognition in personal photo collections; (iii) event recognition in videos; and (iv) event recognition in audio recordings. In this article, we extensively review different deep-learning-based frameworks for event recognition in these four domains. Furthermore, we also review some benchmark datasets made available to the scientific community to validate novel event recognition pipelines. In the final part of the manuscript, we also provide a detailed discussion on basic insights gathered from the literature review, and identify future trends and challenges.

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Networks and Communications,Hardware and Architecture

Link

https://dl.acm.org/doi/pdf/10.1145/3306240

Reference170 articles.

1. Sharath Adavanne Giambattista Parascandolo Pasi Pertilä Toni Heittola and Tuomas Virtanen. 2017. Sound event detection in multichannel audio using spatial and harmonic features. arXiv preprint arXiv:1706.02293 (2017). Sharath Adavanne Giambattista Parascandolo Pasi Pertilä Toni Heittola and Tuomas Virtanen. 2017. Sound event detection in multichannel audio using spatial and harmonic features. arXiv preprint arXiv:1706.02293 (2017).

2. Sharath Adavanne Archontis Politis and Tuomas Virtanen. 2018. Multichannel sound event detection using 3D convolutional neural networks for learning inter-channel features. arXiv preprint arXiv:1801.09522 (2018). Sharath Adavanne Archontis Politis and Tuomas Virtanen. 2018. Multichannel sound event detection using 3D convolutional neural networks for learning inter-channel features. arXiv preprint arXiv:1801.09522 (2018).

3. USED

4. Event recognition in personal photo collections via multiple instance learning-based classification of multiple images

5. A saliency-based approach to event recognition

Cited by 38 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Explainable event recognition;Multimedia Tools and Applications;2023-03-30

2. Machine Learning Applications for Renewable Energy Systems;Advances in Artificial Intelligence for Renewable Energy Systems and Energy Autonomy;2023

3. Role of Social Media Imagery in Disaster Informatics;International Handbook of Disaster Research;2023

4. Role of Social Media Imagery in Disaster Informatics;International Handbook of Disaster Research;2023

5. Self-supervised and semi-supervised learning for road condition estimation from distributed road-side cameras;Scientific Reports;2022-12-26