Utilizing Google Images for Training Classifiers in CRF-Based Semantic Segmentation
-
Published:2016-05-19
Issue:3
Volume:20
Page:455-461
-
ISSN:1883-8014
-
Container-title:Journal of Advanced Computational Intelligence and Intelligent Informatics
-
language:en
-
Short-container-title:JACIII
Author:
Rangkuti Rizki Perdana, ,Dewanto Vektor,Aprinaldi ,Jatmiko Wisnu,
Abstract
One promising approach to pixel-wise semantic segmentation is based on conditional random fields (CRFs). CRF-based semantic segmentation requires ground-truth annotations to supervisedly train the classifier that generates unary potentials. However, the number of (public) annotation data for training is limitedly small. We observe that the Internet can provide relevant images for any given keywords. Our idea is to convert keyword-related images to pixel-wise annotated images, then use them as training data. In particular, we rely on saliency filters to identify the salient object (foreground) of a retrieved image, which mostly agrees with the given keyword. We utilize saliency information for back-and-foreground CRF-based semantic segmentation to further obtain pixel-wise ground-truth annotations. Experiment results show that training data from Google images improves both the learning performance and the accuracy of semantic segmentation. This suggests that our proposed method is promising for harvesting substantial training data from the Internet for training the classifier in CRF-based semantic segmentation.
Publisher
Fuji Technology Press Ltd.
Subject
Artificial Intelligence,Computer Vision and Pattern Recognition,Human-Computer Interaction
Reference11 articles.
1. J. Shotton, J. Winn, C. Rother, and A. Criminisi, “TextonBoostfor Image Understanding: Multi-Class Object Recognition and Segmentation by Jointly Modeling Texture, Layout, and Context,” Int. J. Comput. Vision, Vol.81, No.1, pp. 2-23, 2009. 2. P. Kohli, L. Ladickacutey, and P. H. Torr, “Robust Higher Order Potentials for Enforcing Label Consistency,” Int. J. Comput. Vision, Vol.82, No.3, pp. 302-324, 2009. 3. P. Kra"henbühl and V. Koltun, “Efficient Inference in Fully Connected CRFs with Gaussian Edge Potentials,” Advances in Neural Information Processing Systems, Vol.24, pp. 109-117, Curran Associates, Inc., 2011. 4. M. Szummer, P. Kohli, and D. Hoiem, “Learning CRFs using Graph Cuts,” European Conf. on Computer Vision, 2008. 5. T. Joachims, T. Hofmann, Y. Yue, and C. N. Yu, “Predicting Structured Objects with Support Vector Machines,” Communications of the ACM, Research Highlight, Vol.52, No.11, pp. 97-104, 2009.
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|