Author:
Singh Naorem Karline,Chanu Yambem Jina,Pangsatabam Hoomexsun
Publisher
Springer Nature Singapore
Reference18 articles.
1. Kanda N, Takeda R, Obuchi Y (2013) Elastic spectral distortion for low resource speech recognition with deep neural networks. In: 2013 IEEE Workshop on automatic speech recognition and understanding. IEEE, pp 309–314
2. Ragni A, Knill KM, Rath SP, Gales MJF (2014) Data augmentation for low resource languages. In: INTERSPEECH 2014: 15th Annual conference of the international speech communication association, Singapore. International Speech Communication Association (ISCA), pp 810–814
3. Jaitly N, Hinton GE (2013) Vocal tract length perturbation (VTLP) improves speech recognition. In: Proceedings of the 30th international conference on machine learning. JMLR:W&CP, vol 28, pp 1–5
4. Ko T, Peddinti V, Povey D, Khudanpur S (2015) Audio augmentation for speech recognition. In: INTERSPEECH 2015, pp 3586–3589
5. Mallidi SH, Hermansky H (2016) Novel neural network based fusion for multistream ASR. In: 2016 IEEE International conference on acoustics, speech and signal processing (ICASSP). IEEE, pp 5680–5684