Author:
Yoshioka Takuya,Erdogan Hakan,Chen Zhuo,Alleva Fil
Cited by
79 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. T-SOT FNT: Streaming Multi-Talker ASR with Text-Only Domain Adaptation Capability;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14
2. Automatic Channel Selection and Spatial Feature Integration for Multi-Channel Speech Recognition Across Various Array Topologies;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14
3. Cross-Speaker Encoding Network for Multi-Talker Speech Recognition;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14
4. LAVSS: Location-Guided Audio-Visual Spatial Audio Separation;2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV);2024-01-03
5. Spatially Selective Speaker Separation Using a DNN With a Location Dependent Feature Extraction;IEEE/ACM Transactions on Audio, Speech, and Language Processing;2024