Abstract
Abstract
In the human ear, the basilar membrane plays a central role in sound recognition. When excited by sound, this membrane responds with a frequency-dependent displacement pattern that is detected and identified by the auditory hair cells combined with the human neural system. Inspired by this structure, we designed and fabricated an artificial membrane that produces a spatial displacement pattern in response to an audible signal, which we used to train a convolutional neural network. When trained with single frequency tones, this system can unambiguously distinguish tones closely spaced in frequency. When instead trained to recognize spoken vowels, this system outperforms existing methods for phoneme recognition, including the discrete Fourier transform, zoom FFT and chirp z-transform, especially when tested in short time windows. This sound recognition scheme therefore promises significant benefits in fast and accurate sound identification compared to existing methods.
Funder
National Research Foundation of Korea
Subject
Engineering (miscellaneous),Molecular Medicine,Biochemistry,Biophysics,Biotechnology
Reference29 articles.
1. Overview: cochlear neurobiology;Dallos,1996
2. Mechanics of the mammalian cochlea;Robles;Physiol. Rev.,2001
3. Longitudinal pattern of basilar membrane vibration in the sensitive cochlea;Ren;Proc. Natl Acad. Sci.,2002
4. Acoustic modeling using deep belief networks;Mohamed;IEEE Trans. Audio Speech Lang. Process.,2012
5. Speech acoustic modeling from raw multichannel waveforms;Hoshen,2015
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献