Semantic Image Classification using Deep Neural Network with Bag of Words layer

Author:

Lakshmi S. Jothi1

Affiliation:

1. Kamaraj college of engineering and technology

Abstract

Abstract The Bag of Visual Words (BoVWs) model is an important concept in computer vision that can be used to classify an image using image features as visual words. It represents image as a collection of unordered image patches. It has three independent steps, feature detection, feature description, and codebook generation and they are hard to be joined. It uses local features, it leads to semantic gap. To address all these issues, in our model Bag of Words layer is used in deep neural network to generate category specific visual words. Deep neural network has convolutional layers that extract more features so it also helps to solve semantic gap issue. By this technique CBIR becomes an easy one. The proposed model will retrieve similar images based on the user query image, by analyzing the features or content of the given image. It also uses the inverted file index strategy for image retrieval process. Fast convergence of training procedure, semantically discriminative ability and sparsity of visual words are added to achieve high performance of classification and retrieval process.

Publisher

Research Square Platform LLC

Reference20 articles.

1. D. &. S. H. Nister, “Scalable recognition with a vocabulary tree,” in 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06), 2006, June.

2. Z. K. Q. I. M. &. S. J. Wu, “ Bundling features for large scale partial-duplicate web image search,” in 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009, June.

3. J. &. Z. A. Sivic, “Video Google: A text retrieval approach to object matching in videos. In Computer Vision,” in IEEE International Conference on IEEE Computer Society, 2003, October.

4. J. R. B. C. E. A. A. Z. A. &. F. W. T. Sivic, in Discovering objects and their location in images. In Tenth IEEE International Conference on Computer Vision (ICCV'05), 2005, October.

5. Distinctive image features from scale-invariant keypoints;Lowe DG;International journal of computer vision,2004

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3