Author:
Le Minh Tu,Ta Bao Thang,Le Nhat Minh,Nguyen Phi Le,Do Van Hai
Publisher
Springer Nature Singapore
Reference29 articles.
1. Burnham, D., et al.: Building an audio-visual corpus of Australian English: large corpus collection with an economical portable and replicable black box. In: Proceedings of Interspeech 2011, pp. 841–844 (2011). https://doi.org/10.21437/Interspeech.2011-309
2. Demirsahin, I., Kjartansson, O., Gutkin, A., Rivera, C.: Open-source multi-speaker corpora of the English accents in the British isles. In: Proceedings of the Twelfth Language Resources and Evaluation Conference, pp. 6532–6541 (2020)
3. Desplanques, B., Thienpondt, J., Demuynck, K.: ECAPA-TDNN: emphasized channel attention, propagation and aggregation in TDNN based speaker verification. In: Proceedings of Interspeech 2020, pp. 3830–3834 (2020). https://doi.org/10.21437/Interspeech.2020-2650
4. Gong, W., Wang, J., Liu, Y., Yang, H.: A no-reference speech quality assessment method based on neural network with densely connected convolutional architecture. In: Proceedings of INTERSPEECH 2023, pp. 536–540 (2023). https://doi.org/10.21437/Interspeech.2023-811
5. Gulati, A., et al.: Conformer: convolution-augmented transformer for speech recognition. In: Proceedings of Interspeech 2020, pp. 5036–5040 (2020). https://doi.org/10.21437/Interspeech.2020-3015