Author:
Kanda Naoyuki,Gaur Yashesh,Wang Xiaofei,Meng Zhong,Yoshioka Takuya
Cited by
7 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. One Model to Rule Them All ? Towards End-to-End Joint Speaker Diarization and Speech Recognition;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14
2. T-SOT FNT: Streaming Multi-Talker ASR with Text-Only Domain Adaptation Capability;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14
3. Extending Whisper with Prompt Tuning to Target-Speaker ASR;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14
4. Enhancing End-to-End Conversational Speech Translation Through Target Language Context Utilization;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14
5. SA-SOT: Speaker-Aware Serialized Output Training for Multi-Talker ASR;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14