1. Yusuf Aytar , Carl Vondrick , and Antonio Torralba . 2016 . Soundnet: Learning sound representations from unlabeled video. Advances in neural information processing systems (2016). Yusuf Aytar, Carl Vondrick, and Antonio Torralba. 2016. Soundnet: Learning sound representations from unlabeled video. Advances in neural information processing systems (2016).
2. Visual saliency model for robot cameras
3. Zoya Bylinskii , Tilke Judd , Aude Oliva , Antonio Torralba , and Frédo Durand . 2018. What do different evaluation metrics tell us about saliency models?IEEE transactions on pattern analysis and machine intelligence ( 2018 ). Zoya Bylinskii, Tilke Judd, Aude Oliva, Antonio Torralba, and Frédo Durand. 2018. What do different evaluation metrics tell us about saliency models?IEEE transactions on pattern analysis and machine intelligence (2018).
4. Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
5. Jiazhong Chen , Qingqing Li , Hefei Ling , Dakai Ren , and Ping Duan . 2021. Audiovisual saliency prediction via deep learning. Neurocomputing ( 2021 ). Jiazhong Chen, Qingqing Li, Hefei Ling, Dakai Ren, and Ping Duan. 2021. Audiovisual saliency prediction via deep learning. Neurocomputing (2021).