1. Daniilidis, K., Maragos, P., and Paragios, N. (2010, January 5–11). Every Picture Tells a Story: Generating Sentences from Images. Proceedings of the Computer Vision—ECCV 2010, Crete, Greece.
2. Kulkarni, G., Premraj, V., Dhar, S., Li, S., Choi, Y., Berg, A.C., and Berg, T.L. (2011, January 20–25). Baby Talk: Understanding and Generating Simple Image Descriptions. Proceedings of the CVPR 2011, Colorado Springs, CO, USA.
3. Mitchell, M., Han, X., Dodge, J., Mensch, A., Goyal, A., Berg, A., Yamaguchi, K., Berg, T., Stratos, K., and Daumé, H. (2012, January 23–27). Midge: Generating Image Descriptions from Computer Vision Detections. Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France.
4. Kuznetsova, P., Ordonez, V., Berg, A., Berg, T., and Choi, Y. (2012, January 8–14). Collective Generation of Natural Image Descriptions. Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Jeju Island, Republic of Korea.
5. TreeTalk: Composition and Compression of Trees for Image Descriptions;Kuznetsova;Trans. Assoc. Comput. Linguist.,2014