Abstract
AbstractCochlear implants (CIs) have proven to be successful at restoring the sensation of hearing in people who suffer from profound sensorineural hearing loss. CI users generally achieve good speech understanding in quiet acoustic conditions. However, their ability to understand speech degrades drastically when background interfering noise is present. To address this problem, current CI systems are delivered with front-end speech enhancement modules that can aid the listener in noisy environments. However, these only perform well under certain noisy conditions, leaving quite some room for improvement in more challenging circumstances. In this work, we propose replacing the CI sound coding strategy with a deep neural network (DNN) that performs end-to-end speech denoising by taking the raw audio as input and providing a denoised electrodogram, i.e., the electrical stimulation patterns applied to the electrodes across time. We specifically introduce a DNN that emulates a common CI sound coding strategy, the advanced combination encoder (ACE). We refer to the proposed algorithm as ‘Deep ACE’. Deep ACE is designed not only to accurately code the acoustic signals in the same way that ACE would but also to automatically remove unwanted interfering noises, without sacrificing processing latency. The model was optimized using a CI-specific loss function and evaluated using objective measures as well as listening tests in CI participants. Results show that, based on objective measures, the proposed model achieved higher scores when compared to the baseline algorithms. Also, the proposed deep learning-based sound coding strategy gave eight CI users the highest speech intelligibility results.
Publisher
Cold Spring Harbor Laboratory
Reference44 articles.
1. Sound coding in cochlear implants: From electric pulses to hearing;IEEE Signal Processing Magazine,2015
2. Better speech recognition with cochlear implants
3. Architecture of the spectra 22 speech processor;Annals of Otology, Rhinology and Laryngology,1995
4. A psychoacoustic “NofM”-type speech coding strategy for cochlear implants;EURASIP Journal on Advances in Signal Processing,2005
5. Effects of noise and noise suppression on speech perception by CI users;Ear and Hearing,1992