1. J. Sivic, A. Zisserman. Video Google: a text retrieval approach to object matching in videos, in: Proceedings of the IEEE International Conference of Computer Vision, vol. 2, 2003, pp.1470–1477.
2. R. Ohbuchi, T. Takei. Shape-similarity comparison of 3D models using alpha shapes, in: Proceedings of the Pacific Graphics, 2003, pp. 293–302.
3. S. Savarese, J. Winn, A. Criminisi. Discriminative object class models of appearance and shape by correlations, in: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, vol. 2, 2006, pp. 2033–2040.
4. S. Wong, T. Kim, R. Cipolla. Learning motion categories using both semantic and structural information, in: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2007, pp. 1–6.
5. P. Dollar, V. Rabaud, G. Cottrell, S. Belongie. Behavior recognition via sparse spatio-temporal features, in: Proceedings of the Second Joint IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance, 2005, pp. 65–72.