A Hybrid Approach Based on GAN and CNN-LSTM for Aerial Activity Recognition-Reference-Cited by-同舟云学术

A Hybrid Approach Based on GAN and CNN-LSTM for Aerial Activity Recognition

Published:2023-07-21 Issue:14 Volume:15 Page:3626
ISSN:2072-4292
Container-title:Remote Sensing
language:en
Short-container-title:Remote Sensing

Author:

Bousmina Abir¹,Selmi Mouna¹,Ben Rhaiem Mohamed Amine¹²,Farah Imed Riadh¹^ORCID

Affiliation:

1. RIADI Laboratory, National School of Computer Sciences, University of Manouba, Manouba 2010, Tunisia

2. CNCT, National Mapping and Remote Sensing Center, Tunis 2045, Tunisia

Abstract

Unmanned aerial vehicles (UAVs), known as drones, have played a significant role in recent years in creating resilient smart cities. UAVs can be used for a wide range of applications, including emergency response, civil protection, search and rescue, and surveillance, thanks to their high mobility and reasonable price. Automatic recognition of human activity in aerial videos captured by drones is critical for various tasks for these applications. However, this is difficult due to many factors specific to aerial views, including camera motion, vibration, low resolution, background clutter, lighting conditions, and variations in view. Although deep learning approaches have demonstrated their effectiveness in a variety of challenging vision tasks, they require either a large number of labelled aerial videos for training or a dataset with balanced classes, both of which can be difficult to obtain. To address these challenges, a hybrid data augmentation method is proposed which combines data transformation with the Wasserstein Generative Adversarial Network (GAN)-based feature augmentation method. In particular, we apply the basic transformation methods to increase the amount of video in the database. A Convolutional Neural Network–Long Short-Term Memory (CNN-LSTM) model is used to learn the spatio-temporal dynamics of actions, then a GAN-based technique is applied to generate synthetic CNN-LSTM features conditioned on action classes which provide a high discriminative spatio-temporal features. We tested our model on the YouTube aerial database, demonstrating encouraging results that surpass those of previous state-of-the-art works, including an accuracy rate of 97.83%.

Publisher

MDPI AG

Subject

General Earth and Planetary Sciences

Link

https://www.mdpi.com/2072-4292/15/14/3626/pdf

Reference63 articles.

1. Involvement of Surveillance Drones in Smart Cities: A Systematic Review;Gohari;IEEE Access,2022

2. Applications of drone in disaster management: A scoping review;Heo;Sci. Justice,2022

3. Autonomous UAV for suspicious action detection using pictorial human pose estimation and classification;Penmetsa;Elcvia Electron. Lett. Comput. Vis. Image Anal.,2014

4. Human action recognition in drone videos using a few aerial training examples;Sultani;Comput. Vis. Image Underst.,2021

5. Data augmentation: A comprehensive survey of modern approaches;Mumuni;Array,2022

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A survey of multimodal hybrid deep learning for computer vision: Architectures, applications, trends, and challenges;Information Fusion;2024-05

2. Detection and Recognition of Voice Commands by a Distributed Acoustic Sensor Based on Phase-Sensitive OTDR in the Smart Home Concept;Sensors;2024-04-03

3. Traffic Sign Recognition and Classification using Deep Neural Networks;Journal of Soft Computing Paradigm;2024-03

4. Cloud Cover Removal from Remote Sensing Data using GANs Based on Attention Mechanism;2023 Seventh International Conference on Image Information Processing (ICIIP);2023-11-22

5. Smart-Data-Glove-Based Gesture Recognition for Amphibious Communication;Micromachines;2023-10-31