1. Amdal, I., Strand, O.M., Almberg, J., Svendsen, T.: RUNDKAST: an annotated Norwegian broadcast news speech corpus. In: LREC 2008, Marrakech, Morocco. ELRA (2008)
2. Baevski, A., Zhou, Y., Mohamed, A., Auli, M.: Wav2vec 2.0: a framework for self-supervised learning of speech representations. In: NeurIPS 2020, virtual event (2020)
3. Cho, J., et al.: Multilingual sequence-to-sequence speech recognition: architecture, transfer learning, and language modeling. In: SLT 2018, Athens, Greece, pp. 521–527. IEEE (2018)
4. Chorowski, J., Bahdanau, D., Serdyuk, D., Cho, K., Bengio, Y.: Attention-based models for speech recognition. In: NeurIPS 2015, Montreal, Canada, pp. 577–585 (2015)
5. Conneau, A., et al.: FLEURS: few-shot learning evaluation of universal representations of speech. In: SLT 2022, Doha, Qatar, pp. 798–805. IEEE (2022)