Author:
Shi Bowen,Hsu Wei-Ning,Mohamed Abdelrahman
Cited by
46 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. 3D facial animation driven by speech-video dual-modal signals;Complex & Intelligent Systems;2024-05-23
2. Sla-former: conformer using shifted linear attention for audio-visual speech recognition;Complex & Intelligent Systems;2024-05-18
3. Robust Dual-Modal Speech Keyword Spotting for XR Headsets;IEEE Transactions on Visualization and Computer Graphics;2024-05
4. ViLaS: Exploring the Effects of Vision and Language Context in Automatic Speech Recognition;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14
5. SlideSpeech: A Large Scale Slide-Enriched Audio-Visual Corpus;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14