1. Robust speech recognition via large-scale weak supervision;Radford;arXiv preprint arXiv:2212.04356,2022
2. Scaling speech technology to 1,000+ languages;Pratap;arXiv preprint arXiv:2305.13516,2023
3. Google usm: Scaling automatic speech recognition beyond 100 languages;Zhang;arXiv preprint arXiv:2303.01037,2023
4. Faster whisper transcription with ctranslate2;Klein;GitHub repository,2023
5. WhisperX: Time-Accurate Speech Transcription of Long-Form Audio