Author:
Toshniwal Shubham,Kannan Anjuli,Chiu Chung-Cheng,Wu Yonghui,Sainath Tara N,Livescu Karen
Cited by
97 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Speaking in Terms of Money: Financial Knowledge Acquisition via Speech Data Generation;ACM Journal on Computing and Sustainable Societies;2024-07-05
2. Improving End-to-End Speech Recognition Through Conditional Cross-Modal Knowledge Distillation with Language Model;2024 International Joint Conference on Neural Networks (IJCNN);2024-06-30
3. ViLaS: Exploring the Effects of Vision and Language Context in Automatic Speech Recognition;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14
4. Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech Recognition;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14
5. Multiple Representation Transfer from Large Language Models to End-to-End ASR Systems;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14