1. Modeling the shape of the scene: a holistic representation of the spatial envelope;Oliva;Int. J. Comput. Vis.,2001
2. What, where and who? Classifying events by scene and object recognition;Li,2007
3. Automatically annotating the Mir Flickr dataset: experimental protocols, openly available data and semantic spaces;Hare,2010
4. Microsoft coco: common objects in context;Lin,2014
5. Topic modeling of multimodal data: an autoregressive approach;Zheng,2014