1. van den Oord, A., et al.: WaveNet: a generative model for raw audio.
http://arxiv.org/abs/1609.03499
. Accessed 26 Nov 2019
2. Amodei, D., et al.: Deep speech 2: end-to-end speech recognition in English and mandarin.
http://arxiv.org/abs/1512.02595
. Accessed 13 Jan 2020
3. Collobert, R., Puhrsch, C., Synnaeve, G.: Wav2Letter: an end-to-end convnet-based speech recognition system.
http://arxiv.org/abs/1609.03193
. Accessed 15 Jan 2020
4. Prenger, R., Valle, R., Catanzaro, B.: Waveglow: a flow-based generative network for speech synthesis. In: ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, United Kingdom, pp. 3617–3621 (2019).
https://doi.org/10.1109/icassp.2019.8683143
5. Spanias, A., Painter, T., Atti, V.: Audio Signal Processing and Coding. Wiley, Hoboken (2007)