Neurogenerative Disease Diagnosis in Cepstral Domain Using MFCC with Deep Learning-Reference-Cited by-同舟云学术

Neurogenerative Disease Diagnosis in Cepstral Domain Using MFCC with Deep Learning

Published:2022-04-04 Issue: Volume:2022 Page:1-15
ISSN:1748-6718
Container-title:Computational and Mathematical Methods in Medicine
language:en
Short-container-title:Computational and Mathematical Methods in Medicine

Author:

Alghamdi Norah Saleh¹^ORCID,Zakariah Mohammed²^ORCID,Hoang Vinh Truong³^ORCID,Elahi Mohammad Mamun⁴^ORCID

Affiliation:

1. Department of Computer Sciences, College of Computer and Information Sciences, Princess Nourah Bint Abdulrahman University, P.O.Box 84428, Riyadh 11671, Saudi Arabia

2. Department of Computer Science, College of Computer and Information Sciences, King Saud University, P.O. Box 57168, Riyadh 21574, Saudi Arabia

3. Faculty of Computer Science, Ho Chi Minh City Open University, 97 Vo Van Tan, Ward Vo Thi Sau, District 3. Ho Chi Minh City: 70000, Vietnam

4. Department of Computer Science and Engineering, United International University, Dhaka, Bangladesh

Abstract

Because underlying cognitive and neuromuscular activities regulate speech signals, biomarkers in the human voice can provide insight into neurological illnesses. Multiple motor and nonmotor aspects of neurologic voice disorders arise from an underlying neurologic condition such as Parkinson’s disease, multiple sclerosis, myasthenia gravis, or ALS. Voice problems can be caused by disorders that affect the corticospinal system, cerebellum, basal ganglia, and upper or lower motoneurons. According to a new study, voice pathology detection technologies can successfully aid in the assessment of voice irregularities and enable the early diagnosis of voice pathology. In this paper, we offer two deep-learning-based computational models, 1-dimensional convolutional neural network (1D CNN) and 2-dimensional convolutional neural network (2D CNN), that simultaneously detect voice pathologies caused by neurological illnesses or other causes. From the German corpus Saarbruecken Voice Database (SVD), we used voice recordings of sustained vowel /a/ generated at normal pitch. The collected voice signals are padded and segmented to maintain homogeneity and increase the number of samples. Convolutional layers are applied to raw data, and MFCC features are extracted in this project. Although the 1D CNN had the maximum accuracy of 93.11% on test data, model training produced overfitting and 2D CNN, which generalized the data better and had lower train and validation loss despite having an accuracy of 84.17% on test data. Also, 2D CNN outperforms state-of-the-art studies in the field, implying that a model trained on handcrafted features is better for speech processing than a model that extracts features directly.

Funder

Princess Nourah Bint Abdulrahman University

Publisher

Hindawi Limited

Subject

Applied Mathematics,General Immunology and Microbiology,General Biochemistry, Genetics and Molecular Biology,Modeling and Simulation,General Medicine

Link

http://downloads.hindawi.com/journals/cmmm/2022/4364186.pdf

Reference57 articles.

1. A Survey of Voice Pathology Surveillance Systems Based on Internet of Things and Machine Learning Algorithms

2. Changes in acoustic characteristics of the voice across the life span: Measures from individuals 4--93 years of age;E. T. Stathopoulos;Journal of Speech, Language, and Hearing Research,2011

3. Parkinson’s disease: clinical features and diagnosis;J. Jankovic;Journal of neurology, neurosurgery & psychiatry,2008

4. Accurate telemonitoring of Parkinson’s disease progression by non-invasive speech tests

5. Collection and Analysis of a Parkinson Speech Dataset With Multiple Types of Sound Recordings

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Sound as a bell: a deep learning approach for health status classification through speech acoustic biomarkers;Chinese Medicine;2024-07-24

2. ORG-RGRU: An automated diagnosed model for multiple diseases by heuristically based optimized deep learning using speech/voice signal;Biomedical Signal Processing and Control;2024-02

3. Prediction Model for Unfavorable Outcome in Spontaneous Intracerebral Hemorrhage Based on Machine Learning;Journal of Korean Neurosurgical Society;2024-01-01

4. Retracted: Neurogenerative Disease Diagnosis in Cepstral Domain Using MFCC with Deep Learning;Computational and Mathematical Methods in Medicine;2023-12-13

5. Predicting the Remaining Time before Earthquake Occurrence Based on Mel Spectrogram Features Extraction and Ensemble Learning;Applied Sciences;2023-11-13