Deep Learning for Activity Recognition Using Audio and Video-Reference-Cited by-同舟云学术

Deep Learning for Activity Recognition Using Audio and Video

Published:2022-03-03 Issue:5 Volume:11 Page:782
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Reinolds Francisco^ORCID,Neto Cristiana^ORCID,Machado José^ORCID

Abstract

Neural networks have established themselves as powerhouses in what concerns several types of detection, ranging from human activities to their emotions. Several types of analysis exist, and the most popular and successful is video. However, there are other kinds of analysis, which, despite not being used as often, are still promising. In this article, a comparison between audio and video analysis is drawn in an attempt to classify violence detection in real-time streams. This study, which followed the CRISP-DM methodology, made use of several models available through PyTorch in order to test a diverse set of models and achieve robust results. The results obtained proved why video analysis has such prevalence, with the video classification handily outperforming its audio classification counterpart. Whilst the audio models attained on average 76% accuracy, video models secured average scores of 89%, showing a significant difference in performance. This study concluded that the applied methods are quite promising in detecting violence, using both audio and video.

Funder

Fundação para a Ciência e Tecnologia

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/11/5/782/pdf

Reference22 articles.

1. Enabling Cognitive Smart Cities Using Big Data and Machine Learning: Approaches and Challenges

2. Video-Based Detection Infrastructure Enhancement for Automated Ship Recognition and Behavior Analysis

3. A Survey on Human Behavior Recognition Using Smartphone-Based Ultrasonic Signal

4. In-car violence detection based on the audio signal;Santos,2021

5. Review of trends in automatic human activity recognition using synthetic audio-visual data;Jesus,2020

Cited by 11 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A review of video-based human activity recognition: theory, methods and applications;Multimedia Tools and Applications;2024-07-10

2. Longitudinal tear detection method for conveyor belt based on multi-mode fusion;Wireless Networks;2024-03-13

3. Review for Augmented Reality Shopping Application for Mobile Systems;Marketing and Smart Technologies;2023-09-05

4. Supervised Video Cloth Simulation: Exploring Softness and Stiffness Variations on Fabric Types Using Deep Learning;Applied Sciences;2023-08-22

5. Enhancing CSI-Based Human Activity Recognition by Edge Detection Techniques;Information;2023-07-14