Publisher
Springer International Publishing
Reference21 articles.
1. OpenASR21 Homepage. https://sat.nist.gov/openasr21
2. Baevski, A., Zhou, Y., Mohamed, A., et al.: wav2vec 2.0: a framework for self-supervised learning of speech representations. Adv. Neural Inf. Process. Syst. 33, 12449–12460 (2020)
3. Ghahremani, P., Manohar, V., Povey, D., et al.: Acoustic modelling from the signal domain using cnns. In: Interspeech, pp. 3434–3438 (2016)
4. Hsu, W.N., Bolte, B., Tsai, Y.H.H., et al.: Hubert: self-supervised speech representation learning by masked prediction of hidden units. IEEE/ACM Trans. Audio Speech Lang. Process. 29, 3451–3460 (2021)
5. Hsu, W.N., Sriram, A., Baevski, A., et al.: Robust wav2vec 2.0: analyzing domain shift in self-supervised pre-training. In: Interspeech, pp. 721–725 (2021)