Utilizing Relevant RGB–D Data to Help Recognize RGB Images in the Target Domain-Reference-Cited by-同舟云学术

Utilizing Relevant RGB–D Data to Help Recognize RGB Images in the Target Domain

Published:2019-09-01 Issue:3 Volume:29 Page:611-621
ISSN:2083-8492
Container-title:International Journal of Applied Mathematics and Computer Science
language:en
Short-container-title:

Author:

Gao Depeng¹,Liu Jiafeng¹,Wu Rui¹,Cheng Dansong¹,Fan Xiaopeng¹,Tang Xianglong¹

Affiliation:

1. School of Computer Science and Technology , Harbin Institute of Technology , No. 92 Xidazhi Street, Harbin , China

Abstract

Abstract With the advent of 3D cameras, getting depth information along with RGB images has been facilitated, which is helpful in various computer vision tasks. However, there are two challenges in using these RGB-D images to help recognize RGB images captured by conventional cameras: one is that the depth images are missing at the testing stage, the other is that the training and test data are drawn from different distributions as they are captured using different equipment. To jointly address the two challenges, we propose an asymmetrical transfer learning framework, wherein three classifiers are trained using the RGB and depth images in the source domain and RGB images in the target domain with a structural risk minimization criterion and regularization theory. A cross-modality co-regularizer is used to restrict the two-source classifier in a consistent manner to increase accuracy. Moreover, an L 2,1 norm cross-domain co-regularizer is used to magnify significant visual features and inhibit insignificant ones in the weight vectors of the two RGB classifiers. Thus, using the cross-modality and cross-domain co-regularizer, the knowledge of RGB-D images in the source domain is transferred to the target domain to improve the target classifier. The results of the experiment show that the proposed method is one of the most effective ones.

Publisher

Walter de Gruyter GmbH

Subject

Applied Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)

Link

https://www.sciendo.com/pdf/10.2478/amcs-2019-0045

Reference41 articles.

1. Argyriou, A., Evgeniou, T. and Pontil, M. (2008). Convex multi-task feature learning, Machine Learning73(3): 243–272.

2. Axler, S. (1997). Linear Algebra Done Right, Undergraduate Texts in Mathematics, Vol. 2, Springer, New York, NY.

3. Belkin, M., Niyogi, P. and Sindhwani, V. (2006). Manifold regularization: A geometric framework for learning from labeled and unlabeled examples, Journal of Machine Learning and Research7: 2399–2434.

4. Bo, L., Ren, X. and Fox, D. (2013). Multipath sparse coding using hierarchical matching pursuit, 2013 IEEE Conference on Computer Vision and Pattern Recognition, Portland, OR, USA, pp. 660–667.

5. Chen, L., Li, W. and Xu, D. (2014). Recognizing RGB images by learning from RGB-D data, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, pp. 1418–1425.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A single image deblurring approach based on a fractional order dark channel prior;International Journal of Applied Mathematics and Computer Science;2022