Affiliation:
1. School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing, China
Abstract
Zero Shot learning (ZSL) aims to use the information of seen classes to recognize unseen classes, which is achieved by transferring knowledge of the seen classes from the semantic embeddings. Since the domains of the seen and unseen classes do not overlap, most ZSL algorithms often suffer from domain shift problem. In this paper, we propose a Dual Discriminative Auto-encoder Network (DDANet), in which visual features and semantic attributes are self-encoded by using the high dimensional latent space instead of the feature space or the low dimensional semantic space. In the embedded latent space, the features are projected to both preserve their original semantic meanings and have discriminative characteristics, which are realized by applying dual semantic auto-encoder and discriminative feature embedding strategy. Moreover, the cross modal reconstruction is applied to obtain interactive information. Extensive experiments are conducted on four popular datasets and the results demonstrate the superiority of this method.
Subject
Artificial Intelligence,General Engineering,Statistics and Probability
Reference31 articles.
1. Akata Zeynep , Perronnin Florent , Harchaoui Zaid and Schmid Cordelia , Label-embedding for attribute-based classification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 819–826, 2013.
2. Akata Zeynep , Reed Scott , Walter Daniel , Lee Honglak and Schiele Bernt , Evaluation of output embeddings for finegrained image classification. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 2927–2936, 2015.
3. Annadani Yashas and Biswas Soma , Preserving semantic relations for zero-shot learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 7603–7612, 2018.
4. Chao Wei-Lun , Changpinyo Soravit , Gong Boqing and Sha Fei , An empirical study and analysis of generalized zero-shot learning for object recognition in the wild. In European Conference on Computer Vision, pages 52–68. Springer, 2016.
5. Deng Jia , Dong Wei , Socher Richard , Li Li-Jia , Li Kai and Fei-Fei Li , Imagenet: A large-scale hierarchical image database. In 2009 IEEE conference on computer vision and pattern recognition, pages 248–255. Ieee, 2009.
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献