A Deep Ensemble Neural Network with Attention Mechanisms for Lung Abnormality Classification Using Audio Inputs-Reference-Cited by-同舟云学术

A Deep Ensemble Neural Network with Attention Mechanisms for Lung Abnormality Classification Using Audio Inputs

Published:2022-07-26 Issue:15 Volume:22 Page:5566
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Wall Conor,Zhang Li^ORCID,Yu Yonghong,Kumar Akshi^ORCID,Gao Rong

Abstract

Medical audio classification for lung abnormality diagnosis is a challenging problem owing to comparatively unstructured audio signals present in the respiratory sound clips. To tackle such challenges, we propose an ensemble model by incorporating diverse deep neural networks with attention mechanisms for undertaking lung abnormality and COVID-19 diagnosis using respiratory, speech, and coughing audio inputs. Specifically, four base deep networks are proposed, which include attention-based Convolutional Recurrent Neural Network (A-CRNN), attention-based bidirectional Long Short-Term Memory (A-BiLSTM), attention-based bidirectional Gated Recurrent Unit (A-BiGRU), as well as Convolutional Neural Network (CNN). A Particle Swarm Optimization (PSO) algorithm is used to optimize the training parameters of each network. An ensemble mechanism is used to integrate the outputs of these base networks by averaging the probability predictions of each class. Evaluated using respiratory ICBHI, Coswara breathing, speech, and cough datasets, as well as a combination of ICBHI and Coswara breathing databases, our ensemble model and base networks achieve ICBHI scores ranging from 0.920 to 0.9766. Most importantly, the empirical results indicate that a positive COVID-19 diagnosis can be distinguished to a high degree from other more common respiratory diseases using audio recordings, based on the combined ICBHI and Coswara breathing datasets.

Funder

Research England

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/22/15/5566/pdf

Reference53 articles.

1. Deep learning based melanoma diagnosis using dermoscopic images;Wall,2020

2. Convolutional neural networks: an overview and application in radiology

3. Classifying Heart Sounds Using Images of Motifs, MFCC and Temporal Features

4. Noise masking recurrent neural network for respiratory sound classification;Kochetov,2018

5. Gated recurrent unit (GRU) for emotion classification from noisy speech;Rana;arXiv,2016

Cited by 12 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Resilient embedded system for classification respiratory diseases in a real time;Biomedical Signal Processing and Control;2024-04

2. Classification of Adventitious Sounds Combining Cochleogram and Vision Transformers;Sensors;2024-01-21

3. Multimedia datasets for anomaly detection: a review;Multimedia Tools and Applications;2023-12-13

4. Enhanced bare-bones particle swarm optimization based evolving deep neural networks;Expert Systems with Applications;2023-11

5. Principles of artificial intelligence and its application in cardiovascular medicine;Clinical Cardiology;2023-09-18