Deep Learning-Based Speech Enhancement With a Loss Trading Off the Speech Distortion and the Noise Residue for Cochlear Implants-Reference-Cited by-同舟云学术

Deep Learning-Based Speech Enhancement With a Loss Trading Off the Speech Distortion and the Noise Residue for Cochlear Implants

Published:2021-11-08 Issue: Volume:8 Page:
ISSN:2296-858X
Container-title:Frontiers in Medicine
language:
Short-container-title:Front. Med.

Author:

Kang Yuyong,Zheng Nengheng,Meng Qinglin

Abstract

The cochlea plays a key role in the transmission from acoustic vibration to neural stimulation upon which the brain perceives the sound. A cochlear implant (CI) is an auditory prosthesis to replace the damaged cochlear hair cells to achieve acoustic-to-neural conversion. However, the CI is a very coarse bionic imitation of the normal cochlea. The highly resolved time-frequency-intensity information transmitted by the normal cochlea, which is vital to high-quality auditory perception such as speech perception in challenging environments, cannot be guaranteed by CIs. Although CI recipients with state-of-the-art commercial CI devices achieve good speech perception in quiet backgrounds, they usually suffer from poor speech perception in noisy environments. Therefore, noise suppression or speech enhancement (SE) is one of the most important technologies for CI. In this study, we introduce recent progress in deep learning (DL), mostly neural networks (NN)-based SE front ends to CI, and discuss how the hearing properties of the CI recipients could be utilized to optimize the DL-based SE. In particular, different loss functions are introduced to supervise the NN training, and a set of objective and subjective experiments is presented. Results verify that the CI recipients are more sensitive to the residual noise than the SE-induced speech distortion, which has been common knowledge in CI research. Furthermore, speech reception threshold (SRT) in noise tests demonstrates that the intelligibility of the denoised speech can be significantly improved when the NN is trained with a loss function bias to more noise suppression than that with equal attention on noise residue and speech distortion.

Publisher

Frontiers Media SA

Subject

General Medicine

Reference42 articles.

1. Cochlear implantation: an overview;Deep;JNLS B.,2019

2. Spoken word recognition in noise in Mandarin-speaking pediatric cochlear implant users;Ren;Int J Pediatr Otorhinolaryngol.,2018

3. Speech perception of elderly cochlear implant users under different noise conditions;Hast;Otol Neurotol.,2015

4. A review of stimulating strategies for cochlear implants;Choi

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Deep learning restores speech intelligibility in multi-talker interference for cochlear implant users;Scientific Reports;2024-06-09

2. Recovering speech intelligibility with deep learning and multiple microphones in noisy-reverberant situations for people using cochlear implants;The Journal of the Acoustical Society of America;2024-06-01

3. Exploring the performance of automatic speaker recognition using twin speech and deep learning-based artificial neural networks;Frontiers in Artificial Intelligence;2024-02-08

4. Low-frequency band gap design of acoustic metamaterial based on cochlear structure;Smart Materials and Structures;2024-01-18

5. Speech Enhancement Based on a Joint Two-Stage CRN+DNN-DEC Model and a New Constrained Phase-Sensitive Magnitude Ratio Mask;IEEE Access;2024