Optimizing Multimodal Scene Recognition through Mutual Information-Based Feature Selection in Deep Learning Models-Reference-Cited by-同舟云学术

Optimizing Multimodal Scene Recognition through Mutual Information-Based Feature Selection in Deep Learning Models

Published:2023-10-29 Issue:21 Volume:13 Page:11829
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Hammad Mohamed¹²^ORCID,Chelloug Samia Allaoua³^ORCID,Alayed Walaa³^ORCID,El-Latif Ahmed A. Abd¹⁴⁵^ORCID

Affiliation:

1. EIAS Data Science Lab, College of Computer and Information Sciences, Prince Sultan University, Riyadh 11586, Saudi Arabia

2. Department of Information Technology, Faculty of Computers and Information, Menoufia University, Shebin El Kom 32511, Egypt

3. Department of Information Technology, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, Riyadh 11671, Saudi Arabia

4. Faculty of Informatics and Computing, Universiti Sultan Zainal Abidin (UniSZA), Besut Campus, Besut 22200, Malaysia

5. Department of Mathematics and Computer Science, Faculty of Science, Menoufia University, Shebin Elkom 32511, Egypt

Abstract

The field of scene recognition, which lies at the crossroads of computer vision and artificial intelligence, has experienced notable progress because of scholarly pursuits. This article introduces a novel methodology for scene recognition by combining convolutional neural networks (CNNs) with feature selection techniques based on mutual information (MI). The main goal of our study is to address the limitations inherent in conventional unimodal methods, with the aim of improving the precision and dependability of scene classification. The focus of our research is around the formulation of a comprehensive approach for scene detection, utilizing multimodal deep learning methodologies implemented on a solitary input image. Our work distinguishes itself by the innovative amalgamation of CNN- and MI-based feature selection. This integration provides distinct advantages and enhanced capabilities when compared to prevailing methodologies. In order to assess the effectiveness of our methodology, we performed tests on two openly accessible datasets, namely, the scene categorization dataset and the AID dataset. The results of these studies exhibited notable levels of precision, with accuracies of 100% and 98.83% achieved for the corresponding datasets. These findings surpass the performance of other established techniques. The primary objective of our end-to-end approach is to reduce complexity and resource requirements, hence creating a robust framework for the task of scene categorization. This work significantly advances the practical application of computer vision in various real-world scenarios, leading to a large improvement in the accuracy of scene recognition and interpretation.

Funder

Deputyship for Research & Innovation, Ministry of Education, in Saudi Arabia

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/13/21/11829/pdf

Reference30 articles.

1. Scene recognition: A comprehensive survey;Xie;Pattern Recognit.,2020

2. A survey of advances in vision-based human motion capture and analysis;Moeslund;Comput. Vis. Image Underst.,2006

3. Deep learning for object detection and scene perception in self-driving cars: Survey, challenges, and open issues;Gupta;Array,2021

4. ROSEFusion: Random optimization for online dense reconstruction under fast camera motion;Zhang;ACM Trans. Graph. TOG,2021

5. Graph convolutional network with triplet attention learning for person re-identification;Saber;Inf. Sci.,2022

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Time Series Feature Selection Method Based on Mutual Information;Applied Sciences;2024-02-28