Efficient Visual Recognition: A Survey on Recent Advances and Brain-inspired Methodologies-Reference-Cited by-同舟云学术

Efficient Visual Recognition: A Survey on Recent Advances and Brain-inspired Methodologies

Published:2022-08-18 Issue:5 Volume:19 Page:366-411
ISSN:2731-538X
Container-title:Machine Intelligence Research
language:en
Short-container-title:Mach. Intell. Res.

Author:

Wu Yang^ORCID,Wang Ding-Heng,Lu Xiao-Tong,Yang Fan,Yao Man,Dong Wei-Sheng,Shi Jian-Bo,Li Guo-Qi^ORCID

Abstract

AbstractVisual recognition is currently one of the most important and active research areas in computer vision, pattern recognition, and even the general field of artificial intelligence. It has great fundamental importance and strong industrial needs, particularly the modern deep neural networks (DNNs) and some brain-inspired methodologies, have largely boosted the recognition performance on many concrete tasks, with the help of large amounts of training data and new powerful computation resources. Although recognition accuracy is usually the first concern for new progresses, efficiency is actually rather important and sometimes critical for both academic research and industrial applications. Moreover, insightful views on the opportunities and challenges of efficiency are also highly required for the entire community. While general surveys on the efficiency issue have been done from various perspectives, as far as we are aware, scarcely any of them focused on visual recognition systematically, and thus it is unclear which progresses are applicable to it and what else should be concerned. In this survey, we present the review of recent advances with our suggestions on the new possible directions towards improving the efficiency of DNN-related and brain-inspired visual recognition approaches, including efficient network compression and dynamic brain-inspired networks. We investigate not only from the model but also from the data point of view (which is not the case in existing surveys) and focus on four typical data types (images, video, points, and events). This survey attempts to provide a systematic summary via a comprehensive survey that can serve as a valuable reference and inspire both researchers and practitioners working on visual recognition problems.

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s11633-022-1340-5.pdf

Reference343 articles.

1. Y. Lecun, L. Bottou, Y. Bengio, P. Haffner. Gradient-based learning applied to document recognition. Proceedings of IEEE, vol. 86, no. 11, pp. 2278–2324, 1998. DOI: https://doi.org/10.1109/5.726791.

2. G. E. Hinton, R. R. Salakhutdinov. Reducing the dimensionality of data with neural networks. Science, vol. 313, no. 5786, pp. 504–507, 2006. DOI: https://doi.org/10.1126/science.1127647.

3. A. Krizhevsky, I. Sutskever, G. E. Hinton. ImageNet classification with deep convolutional neural networks. In Proceedings of the 25th International Conference on Neural Information Processing Systems, Lake Tahoe, USA, pp. 1106–1114, 2012.

4. T. Y. Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Dollár, C. L. Zitnick. Microsoft COCO: Common objects in context. In Proceedings of the 13th European Conference on Computer Vision, Springer, Zurich, Switzerland, pp. 740–755, 2014. DOI: https://doi.org/10.1007/978-3-319-10602-1_48.

5. J. K. Song, Y. Y. Guo, L. L. Gao, X. L. Li, A. Hanjalic, H. T. Shen. From deterministic: to generative: Multimodal stochastic RNNs for video captioning. IEEE Transactions on Neural Networks and Learning Systems, vol. 30, no. 10, pp. 3047–3058, 2019. DOI: https://doi.org/10.1109/TNNLS.2018.2851077.

Cited by 13 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Multi-scale full spike pattern for semantic segmentation;Neural Networks;2024-08

2. Brain-Inspired Computing: A Systematic Survey and Future Trends;Proceedings of the IEEE;2024-06

3. Low-Complexity Vision Transformer with Adaptive Channel Partitioning Method;JOURNAL OF BROADCAST ENGINEERING;2024-05-31

4. Communication signal detection based on high-order cumulants time-frequency analysis: on the application of deep learning YOLOV5 network;Journal of Intelligent & Fuzzy Systems;2024-04-16

5. A Review of Machine Learning and Deep Learning for Object Detection, Semantic Segmentation, and Human Action Recognition in Machine and Robotic Vision;Technologies;2024-01-23