Frequency-Domain and Spatial-Domain MLMVN-Based Convolutional Neural Networks-Reference-Cited by-同舟云学术

Frequency-Domain and Spatial-Domain MLMVN-Based Convolutional Neural Networks

Published:2024-08-17 Issue:8 Volume:17 Page:361
ISSN:1999-4893
Container-title:Algorithms
language:en
Short-container-title:Algorithms

Author:

Aizenberg Igor¹,Vasko Alexander²

Affiliation:

1. Department of Computer Science, Manhattan College, Riverdale, NY 10471, USA

2. Department of Systems Analysis and Optimization Theory, Uzhhorod National University, 88000 Uzhhorod, Ukraine

Abstract

This paper presents a detailed analysis of a convolutional neural network based on multi-valued neurons (CNNMVN) and a fully connected multilayer neural network based on multi-valued neurons (MLMVN), employed here as a convolutional neural network in the frequency domain. We begin by providing an overview of the fundamental concepts underlying CNNMVN, focusing on the organization of convolutional layers and the CNNMVN learning algorithm. The error backpropagation rule for this network is justified and presented in detail. Subsequently, we consider how MLMVN can be used as a convolutional neural network in the frequency domain. It is shown that each neuron in the first hidden layer of MLMVN may work as a frequency-domain convolutional kernel, utilizing the Convolution Theorem. Essentially, these neurons create Fourier transforms of the feature maps that would have resulted from the convolutions in the spatial domain performed in regular convolutional neural networks. Furthermore, we discuss optimization techniques for both networks and compare the resulting convolutions to explore which features they extract from images. Finally, we present experimental results showing that both approaches can achieve high accuracy in image recognition.

Publisher

MDPI AG

Link

https://www.mdpi.com/1999-4893/17/8/361/pdf

Reference82 articles.

1. Krizhevsky, A., Sutskever, I., and Hinton, G.E. (2012, January 3–6). ImageNet Classification with Deep Convolutional Neural Networks. Proceedings of the Advances in Neural Information Processing Systems, Lake Tahoe, NV, USA.

2. Learning Methods for Generic Object Recognition with Invariance to Pose and Lighting;LeCun;Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2004,2004

3. Jarrett, K., Kavukcuoglu, K., Ranzato, M.A., and LeCun, Y. (October, January 27). What Is the Best Multi-Stage Architecture for Object Recognition?. Proceedings of the 2009 IEEE 12th International Conference on Computer Vision, Kyoto, Japan.

4. Text Recognition and Machine Learning: For Impaired Robots and Humans;Gifford;Alta. Acad. Rev.,2019

5. A Text Emotion Analysis Method Using the Dual-Channel Convolution Neural Network in Social Networks;Wu;Math. Probl. Eng.,2020