Funder
Natural Science Foundation of Ningbo Municipality
National Natural Science Foundation of China
Fundamental Research Funds for the Central Universities
Subject
Artificial Intelligence,Cognitive Neuroscience,Computer Science Applications
Reference55 articles.
1. End-to-end audio-visual speech recognition with conformers;Ma,2021
2. Looking to listen at the cocktail party: A speaker-independent audio-visual model for speech separation;Ephrat,2018
3. R. Arandjelovic, A. Zisserman, Look, listen and learn, in: Proceedings of the IEEE International Conference on Computer Vision, 2017, pp. 609–617.
4. Y. Tian, J. Shi, B. Li, Z. Duan, C. Xu, Audio-visual event localization in unconstrained videos, in: Proceedings of the European Conference on Computer Vision, ECCV, 2018, pp. 247–263.
5. Cooperative learning of audio and video models from self-supervised synchronization;Korbar;Adv. Neural Inf. Process. Syst.,2018
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献