Author:
Peng Puyuan,Li Shang-Wen,Räsänen Okko,Mohamed Abdelrahman,Harwath David
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Integrating Self-Supervised Speech Model with Pseudo Word-Level Targets from Visually-Grounded Speech Model;2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW);2024-04-14
2. SD-HuBERT: Sentence-Level Self-Distillation Induces Syllabic Organization in Hubert;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14
3. Visually Grounded Few-Shot Word Learning in Low-Resource Settings;IEEE/ACM Transactions on Audio, Speech, and Language Processing;2024
4. Visually Grounded Speech Models Have a Mutual Exclusivity Bias;Transactions of the Association for Computational Linguistics;2024
5. Audio-Visual Neural Syntax Acquisition;2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU);2023-12-16