1. BembaSpeech: a speech recognition corpus for the Bemba language;Sikasote,2022
2. LIG-AIKUMA: a mobile app to collect parallel speech for under-resourced language studies;Gauthier,2016
3. Towards building ASR systems for the next billion users;Javed;CoRR,2021
4. A. Virkkunen, A. Rouhe, N. Phan, M. Kurimo, and A. Virkkunen anjavirkkunen, “Finnish parliament ASR corpus analysis, benchmarks and statistics · Parliament speech data · HMM-DNN · AED · Wav2vec · Metadata,” Lang. Resour. Eval., 123AD, doi:10.1007/s10579-023-09650-7.
5. Cross-lingual language model pretraining;Conneau,2019