1. Mutan: Multimodal tucker fusion for visual question answering;Ben-Younes,2017
2. Block: Bilinear superdiagonal fusion for visual question answering and visual relationship detection;Ben-Younes,2019
3. Towards a comprehensive computational model for aesthetic assessment of videos;Bhattacharya,2013
4. Large-scale visual sentiment ontology and detectors using adjective noun pairs;Borth,2013
5. From pixels to sentiment: fine-tuning CNNs for visual sentiment prediction;Campos;Image and Vision Computing,2017