Affiliation:
1. School of Information Science and Engineering, University of Jinan, Jinan 250022, China
Abstract
As an important research direction in image and video processing, set-based video recognition requires speed and accuracy. However, the existing static modeling methods focus on computational speed but ignore accuracy, whereas the dynamic modeling methods are higher-accuracy but ignore the computational speed. Combining these two types of methods to obtain fast and accurate recognition results remains a challenging problem. Motivated by this, in this study, a novel Manifolds-based Low-Rank Dictionary Pair Learning (MbLRDPL) method was developed for a set-based video recognition/image set classification task. Specifically, each video or image set was first modeled as a covariance matrix or linear subspace, which can be seen as a point on a Riemannian manifold. Second, the proposed MbLRDPL learned discriminative class-specific synthesis and analysis dictionaries by clearly imposing the nuclear norm on the synthesis dictionaries. The experimental results show that our method achieved the best classification accuracy (100%, 72.16%, 95%) on three datasets with the fastest computing time, reducing the errors of state-of-the-art methods (JMLC, DML, CEBSR) by 0.96–75.69%.
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference31 articles.
1. Wang, R., Guo, H., Davis, L.S., and Dai, Q. (2012, January 16–21). Covariance Discriminative Learning: A Natural and Efficient Approach to Image Set Classification. Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, USA.
2. Multi-model fusion metric learning for image set classification;Gao;Knowl. Based Syst.,2019
3. Image Set-Based Collaborative Representation for Face Recognition;Zhu;IEEE Trans. Inf. Forensics Secur.,2014
4. Yang, M., Zhu, P., Van Gool, L., and Zhang, L. (2013, January 22–26). Face recognition based on regularized nearest points between image sets. Proceedings of the IEEE International Conference on Automatic Face and Gesture Recognition (FG), Shanghai, China.
5. Auto-encoder based structured dictionary learning for visual classification;Liu;Neurocomputing,2021
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Robust Supervised Spline Embedding;IEEE Transactions on Neural Networks and Learning Systems;2024