HCNN: A Neural Network Model for Combining Local and Global Features Towards Human-Like Classification-Reference-Cited by-同舟云学术

HCNN: A Neural Network Model for Combining Local and Global Features Towards Human-Like Classification

Published:2015-12-30 Issue:01 Volume:30 Page:1655004
ISSN:0218-0014
Container-title:International Journal of Pattern Recognition and Artificial Intelligence
language:en
Short-container-title:Int. J. Patt. Recogn. Artif. Intell.

Author:

Zhang Tielin¹,Zeng Yi¹²,Xu Bo¹²

Affiliation:

1. Institute of Automation, Chinese Academy of Sciences, Beijing 100190, P. R. China

2. CAS Center for Excellence in Brain Science and Intelligence Technology, Chinese Academy of Sciences, Shanghai 200031, P. R. China

Abstract

Brain-inspired algorithms such as convolutional neural network (CNN) have helped machine vision systems to achieve state-of-the-art performance for various tasks (e.g. image classification). However, CNNs mainly rely on local features (e.g. hierarchical features of points and angles from images), while important global structured features such as contour features are lost. Global understanding of natural objects is considered to be essential characteristics that the human visual system follows, and for developing human-like visual systems, the lost of consideration from this perspective may lead to inevitable failure on certain tasks. Experimental results have proved that well-trained CNN classifier cannot correctly distinguish fooling images (in which some local features from the natural images are chaotically distributed) from natural images. For example, a picture that is composed of yellow–black bars will be recognized as school bus with very high confidence by CNN. On the contrary, human visual system focuses on both the texture and contour features to form representation of images and would not mis-take them. In order to solve the upper problem, we propose a neural network model, named as histogram of oriented gradient (HOG) improved CNN (HCNN), that combines local and global features towards human-like classification based on CNN and HOG. The experimental results on MNIST datasets and part of ImageNet datasets show that HCNN outperforms traditional CNN for object classification with fooling images, which indicates the feasibility, accuracy and potential effectiveness of HCNN for solving image classification problem.

Publisher

World Scientific Pub Co Pte Lt

Subject

Artificial Intelligence,Computer Vision and Pattern Recognition,Software

Link

https://www.worldscientific.com/doi/pdf/10.1142/S0218001416550041

Reference7 articles.

1. Learning multiple layers of representation

2. Distinctive Image Features from Scale-Invariant Keypoints

3. Human-level control through deep reinforcement learning

4. Example-based object detection in images by components

Cited by 16 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An End-to-End Video Coding Method via Adaptive Vision Transformer;International Journal of Pattern Recognition and Artificial Intelligence;2024-01-29

2. AMFF-net: adaptive multi-modal feature fusion network for image classification;Multimedia Tools and Applications;2023-07-21

3. A multilevel recognition of Meitei Mayek handwritten characters using fusion of features strategy;The Visual Computer;2023-01-31

4. Global-first Training Strategy with Convolutional Neural Networks to Improve Scale Invariance;Communications in Computer and Information Science;2023

5. Research on neural network algorithm in artificial intelligence recognition;Sustainable Energy Technologies and Assessments;2022-10