IoT-Enabled WBAN and Machine Learning for Speech Emotion Recognition in Patients-Reference-Cited by-同舟云学术

IoT-Enabled WBAN and Machine Learning for Speech Emotion Recognition in Patients

Published:2023-03-08 Issue:6 Volume:23 Page:2948
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Olatinwo Damilola D.¹,Abu-Mahfouz Adnan¹²,Hancke Gerhard¹³^ORCID,Myburgh Hermanus¹^ORCID

Affiliation:

1. Department of Electrical, Electronic and Computer Engineering, University of Pretoria, Pretoria 0001, South Africa

2. Council for Scientific and Industrial Research (CSIR), Pretoria 0184, South Africa

3. Department of Computer Science, City University of Hong Kong, Hong Kong, China

Abstract

Internet of things (IoT)-enabled wireless body area network (WBAN) is an emerging technology that combines medical devices, wireless devices, and non-medical devices for healthcare management applications. Speech emotion recognition (SER) is an active research field in the healthcare domain and machine learning. It is a technique that can be used to automatically identify speakers’ emotions from their speech. However, the SER system, especially in the healthcare domain, is confronted with a few challenges. For example, low prediction accuracy, high computational complexity, delay in real-time prediction, and how to identify appropriate features from speech. Motivated by these research gaps, we proposed an emotion-aware IoT-enabled WBAN system within the healthcare framework where data processing and long-range data transmissions are performed by an edge AI system for real-time prediction of patients’ speech emotions as well as to capture the changes in emotions before and after treatment. Additionally, we investigated the effectiveness of different machine learning and deep learning algorithms in terms of performance classification, feature extraction methods, and normalization methods. We developed a hybrid deep learning model, i.e., convolutional neural network (CNN) and bidirectional long short-term memory (BiLSTM), and a regularized CNN model. We combined the models with different optimization strategies and regularization techniques to improve the prediction accuracy, reduce generalization error, and reduce the computational complexity of the neural networks in terms of their computational time, power, and space. Different experiments were performed to check the efficiency and effectiveness of the proposed machine learning and deep learning algorithms. The proposed models are compared with a related existing model for evaluation and validation using standard performance metrics such as prediction accuracy, precision, recall, F1 score, confusion matrix, and the differences between the actual and predicted values. The experimental results proved that one of the proposed models outperformed the existing model with an accuracy of about 98%.

Funder

Council for Scientific and Industrial Research, Pretoria, South Africa

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/23/6/2948/pdf

Reference48 articles.

1. A Bibliometric Analysis and Comprehensive Review of Resource Management Challenges in Internet of Things Networks: The Use of Deep Learning;Olatinwo;IEEE Access,2022

2. A hybrid multi-class MAC protocol for IoT-enabled WBAN systems;Olatinwo;IEEE Sens. J.,2020

3. Developing IoT Based Smart Health Monitoring Systems: A Review;Rahaman;Rev. D’Intell. Artif.,2019