Publisher
Springer Nature Switzerland
Reference39 articles.
1. Arandjelovic, R., Zisserman, A.: Look, listen and learn. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 609–617 (2017)
2. Arandjelovic, R., Zisserman, A.: Objects that sound. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 435–451 (2018)
3. Aytar, Y., Vondrick, C., Torralba, A.: Soundnet: learning sound representations from unlabeled video. Adv. Neural Inf. Processing Syst. 29 (2016)
4. Azulay, A., Weiss, Y.: Why do deep convolutional networks generalize so poorly to small image transformations? arXiv preprint arXiv:1805.12177 (2018)
5. Cai, Z., et al.: A unified multi-scale deep convolutional neural network for fast object detection. In: 14th European Conference on Computer Vision, pp. 354–370 (2016)