Affiliation:
1. School of Computer Science and Engineering, Dalian Minzu University, Dalian 116600, China
Abstract
The ever-increasing size of images has made automatic image annotation one of the most important tasks in the fields of machine learning and computer vision. Despite continuous efforts in inventing new annotation algorithms and new models, results of the state-of-the-art image annotation methods are often unsatisfactory. In this paper, to further improve annotation refinement performance, a novel approach based on weighted mutual information to automatically refine the original annotations of images is proposed. Unlike the traditional refinement model using only visual feature, the proposed model use semantic embedding to properly map labels and visual features to a meaningful semantic space. To accurately measure the relevance between the particular image and its original annotations, the proposed model utilize all available information including image-to-image, label-to-label and image-to-label. Experimental results conducted on three typical datasets show not only the validity of the refinement, but also the superiority of the proposed algorithm over existing ones. The improvement largely benefits from our proposed mutual information method and utilizing all available information.
Publisher
North Atlantic University Union (NAUN)
Subject
Electrical and Electronic Engineering,Signal Processing
Reference48 articles.
1. P. K. Bhagat and P. Choudhary, ‘‘Image annotation: Then and now’’ Image Vis. Comput., vol. 80, pp. 1–23, Dec. 2018.
2. Q. Cheng, Q. Zhang, P. Fu, C. Tu, and S. Li, ‘‘A survey and analysis on automatic image annotation,’’ Pattern Recognit., vol. 79, pp. 242–259, Jul. 2018.
3. V. Lavrenko, R. Manmatha, J. Jeon, “A model for learning the semantics of pictures,” In: Advances in neural information processing systems, pp.553–560, 2003
4. S. Feng, R. Manmatha, and V. Lavrenko, “Multiple bernoulli relevance models for image and video annotation,” in Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004, vol. 2. IEEE, 2004, pp.II–II.
5. A. Makadia, V. Pavlovic, and S. Kumar, “Baselines for image annotation,”International Journal of Computer Vision, vol. 90, no. 1, pp.88–105, 2010.