An Experimental Analysis on Multicepstral Projection Representation Strategies for Dysphonia Detection-Reference-Cited by-同舟云学术

An Experimental Analysis on Multicepstral Projection Representation Strategies for Dysphonia Detection

Published:2023-05-30 Issue:11 Volume:23 Page:5196
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Contreras Rodrigo Colnago¹^ORCID,Viana Monique Simplicio²^ORCID,Fonseca Everthon Silva²^ORCID,dos Santos Francisco Lledo³^ORCID,Zanin Rodrigo Bruno³^ORCID,Guido Rodrigo Capobianco¹^ORCID

Affiliation:

1. Department of Computer Science and Statistics, Institute of Biosciences, Letters and Exact Sciences, São Paulo State University, São José do Rio Preto 15054-000, SP, Brazil

2. Federal Institute of São Paulo, São José do Rio Preto 15030-070, SP, Brazil

3. Faculty of Architecture and Engineering, Mato Grosso State University, Cáceres 78217-900, MT, Brazil

Abstract

Biometrics-based authentication has become the most well-established form of user recognition in systems that demand a certain level of security. For example, the most commonplace social activities stand out, such as access to the work environment or to one’s own bank account. Among all biometrics, voice receives special attention due to factors such as ease of collection, the low cost of reading devices, and the high quantity of literature and software packages available for use. However, these biometrics may have the ability to represent the individual impaired by the phenomenon known as dysphonia, which consists of a change in the sound signal due to some disease that acts on the vocal apparatus. As a consequence, for example, a user with the flu may not be properly authenticated by the recognition system. Therefore, it is important that automatic voice dysphonia detection techniques be developed. In this work, we propose a new framework based on the representation of the voice signal by the multiple projection of cepstral coefficients to promote the detection of dysphonic alterations in the voice through machine learning techniques. Most of the best-known cepstral coefficient extraction techniques in the literature are mapped and analyzed separately and together with measures related to the fundamental frequency of the voice signal, and its representation capacity is evaluated on three classifiers. Finally, the experiments on a subset of the Saarbruecken Voice Database prove the effectiveness of the proposed material in detecting the presence of dysphonia in the voice.

Funder

National Council for Scientific and Technological Development

The State of São Paulo Research Foundation

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/23/11/5196/pdf

Reference105 articles.

1. A survey on biometric authentication: Toward secure and privacy-preserving identification;Rui;IEEE Access,2018

2. A review on performance, security and various biometric template protection schemes for biometric authentication systems;Sarkar;Multimed. Tools Appl.,2020

3. Sharif, M., Raza, M., Shah, J.H., Yasmin, M., and Fernandes, S.L. (2019). Handbook of Multimedia Information Security: Techniques and Applications, Srpinger.

4. Yudin, O., Ziubina, R., Buchyk, S., Bohuslavska, O., and Teliushchenko, V. (2019, January 2–6). Speaker’s Voice Recognition Methods in High-Level Interference Conditions. Proceedings of the 2019 IEEE 2nd Ukraine Conference on Electrical and Computer Engineering (UKRCON), Lviv, Ukraine.

5. Chandra, E., and Sunitha, C. (2009, January 6–7). A review on Speech and Speaker Authentication System using Voice Signal feature selection and extraction. Proceedings of the 2009 IEEE International Advance Computing Conference, Patiala, India.

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Automatic Voice Disorder Detection from a Practical Perspective;Journal of Voice;2024-05

2. Metaheuristic Algorithms for Enhancing Multicepstral Representation in Voice Spoofing Detection: An Experimental Approach;Lecture Notes in Computer Science;2024