1. FastInject: Injecting Unpaired Text Data into CTC-Based ASR Training;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14
2. Audio-Text Multimodal Speech Recognition via Dual-Tower Architecture for Mandarin Air Traffic Control Communications;Computers, Materials & Continua;2024
3. SpeechLM: Enhanced Speech Pre-Training With Unpaired Textual Data;IEEE/ACM Transactions on Audio, Speech, and Language Processing;2024
4. End-to-End Speech Recognition: A Survey;IEEE/ACM Transactions on Audio, Speech, and Language Processing;2024
5. Can Unpaired Textual Data Replace Synthetic Speech in ASR Model Adaptation?;2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU);2023-12-16