Frequency-Domain and Spatial-Domain MLMVN-Based Convolutional Neural Networks

Author:

Aizenberg Igor1,Vasko Alexander2

Affiliation:

1. Department of Computer Science, Manhattan College, Riverdale, NY 10471, USA

2. Department of Systems Analysis and Optimization Theory, Uzhhorod National University, 88000 Uzhhorod, Ukraine

Abstract

This paper presents a detailed analysis of a convolutional neural network based on multi-valued neurons (CNNMVN) and a fully connected multilayer neural network based on multi-valued neurons (MLMVN), employed here as a convolutional neural network in the frequency domain. We begin by providing an overview of the fundamental concepts underlying CNNMVN, focusing on the organization of convolutional layers and the CNNMVN learning algorithm. The error backpropagation rule for this network is justified and presented in detail. Subsequently, we consider how MLMVN can be used as a convolutional neural network in the frequency domain. It is shown that each neuron in the first hidden layer of MLMVN may work as a frequency-domain convolutional kernel, utilizing the Convolution Theorem. Essentially, these neurons create Fourier transforms of the feature maps that would have resulted from the convolutions in the spatial domain performed in regular convolutional neural networks. Furthermore, we discuss optimization techniques for both networks and compare the resulting convolutions to explore which features they extract from images. Finally, we present experimental results showing that both approaches can achieve high accuracy in image recognition.

Publisher

MDPI AG

Reference82 articles.

1. Krizhevsky, A., Sutskever, I., and Hinton, G.E. (2012, January 3–6). ImageNet Classification with Deep Convolutional Neural Networks. Proceedings of the Advances in Neural Information Processing Systems, Lake Tahoe, NV, USA.

2. Learning Methods for Generic Object Recognition with Invariance to Pose and Lighting;LeCun;Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2004,2004

3. Jarrett, K., Kavukcuoglu, K., Ranzato, M.A., and LeCun, Y. (October, January 27). What Is the Best Multi-Stage Architecture for Object Recognition?. Proceedings of the 2009 IEEE 12th International Conference on Computer Vision, Kyoto, Japan.

4. Text Recognition and Machine Learning: For Impaired Robots and Humans;Gifford;Alta. Acad. Rev.,2019

5. A Text Emotion Analysis Method Using the Dual-Channel Convolution Neural Network in Social Networks;Wu;Math. Probl. Eng.,2020

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3