Author:
Chen Ziyang,Fouhey David F.,Owens Andrew
Publisher
Springer Nature Switzerland
Reference89 articles.
1. Time delay estimation for speaker localization using cnn-based parametrized gcc-phat features
2. Adavanne, S., Politis, A., Virtanen, T.: Direction of arrival estimation for multiple sound sources using convolutional recurrent neural network. In: 2018 26th European Signal Processing Conference (EUSIPCO), pp. 1462–1466. IEEE (2018)
3. Afouras, T., Chung, J.S., Zisserman, A.: The conversation: deep audio-visual speech enhancement. arXiv preprint arXiv:1804.04121 (2018)
4. Arandjelović, R., Zisserman, A.: Objects that sound. arXiv preprint arXiv:1712.06651 (2017)
5. Bian, Z., Jabri, A., Efros, A.A., Owens, A.: Learning pixel trajectories with multiscale contrastive random walks. arXiv (2022)
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. The Un-Kidnappable Robot: Acoustic Localization of Sneaking People;2024 IEEE International Conference on Robotics and Automation (ICRA);2024-05-13
2. Exploring Self-supervised Contrastive Learning of Spatial Sound Event Representation;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14
3. Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation;2023 IEEE/CVF International Conference on Computer Vision (ICCV);2023-10-01
4. Audio-Visual Class-Incremental Learning;2023 IEEE/CVF International Conference on Computer Vision (ICCV);2023-10-01
5. Self-Supervised Video Forensics by Audio-Visual Anomaly Detection;2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR);2023-06