Author:
Zhou Junfeng,Wang Feng,Guo Di,Liu Huaping,Sun Fuchun
Publisher
Springer International Publishing
Reference24 articles.
1. Zhao, H., Gan, C., Rouditchenko, A., Vondrick, C., McDermott, J., Torralba, A.: The sound of pixels. arXiv preprint
arXiv:1804.03160
(2018)
2. Owens, A., Efros, A.A.: Audio-visual scene analysis with self-supervised multisensory features. arXiv preprint
arXiv:1804.03641
(2018)
3. Segev, D., Schechner, Y.Y., Elad, M.: Example-based cross-modal denoising. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 486–493. IEEE (2012)
4. Gemmeke, J.F., et al.: Audio set: an ontology and human-labeled dataset for audio events. In: 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 776–780. IEEE (2017)
5. Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems, pp. 91–99 (2015)
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Self-Supervised Learning for Alignment of Objects and Sound;2020 IEEE International Conference on Robotics and Automation (ICRA);2020-05