Exploring Quaternion Neural Network Loss Surfaces-Reference-Cited by-同舟云学术

Exploring Quaternion Neural Network Loss Surfaces

Published:2024-04-24 Issue:3 Volume:34 Page:
ISSN:0188-7009
Container-title:Advances in Applied Clifford Algebras
language:en
Short-container-title:Adv. Appl. Clifford Algebras

Author:

Bill Jeremiah^ORCID,Cox Bruce

Abstract

AbstractThis paper explores the superior performance of quaternion multi-layer perceptron (QMLP) neural networks over real-valued multi-layer perceptron (MLP) neural networks, a phenomenon that has been empirically observed but not thoroughly investigated. The study utilizes loss surface visualization and projection techniques to examine quaternion-based optimization loss surfaces for the first time. The primary contribution of this research is the statistical evidence that QMLP models yield smoother loss surfaces than real-valued neural networks, which are measured and compared using a robust quantitative measure of loss surface “goodness” based on estimates of surface curvature. Extensive computational testing validates the effectiveness of these surface curvature estimates. The paper presents a comprehensive comparison of the average surface curvature of a tuned QMLP model and a tuned real-valued MLP model on both a regression task and a classification task. The results provide strong support for the improved optimization performance observed in QMLPs across various problem domains.

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s00006-024-01313-2.pdf

Reference40 articles.

1. Abiodun, O.I., Jantan, A., Omolara, A.E., Dada, K.V., Mohamed, N.A., Arshad, H.: State-of-the-art in artificial neural network applications: A survey. Heliyon 4(11), e00938 (2018). https://doi.org/10.1016/j.heliyon.2018.e00938. http://www.sciencedirect.com/science/article/pii/S2405844018332067

2. Arena, P., Fortuna, L., Re, R., Xibilia, M.: On the capability of neural networks with complex neurons in complex valued functions approximation. In: 1993 IEEE International Symposium on Circuits and Systems, vol. 4, pp. 2168–2171 (1993). https://doi.org/10.1109/ISCAS.1993.394188

3. Arena, P., Fortuna, L., Occhipinti, L., Xibilia, M.: Neural networks for quaternion-valued function approximation. In: Proceedings of IEEE International Symposium on Circuits and Systems—ISCAS ’94, vol. 6, pp. 307–310 (1994). https://doi.org/10.1109/ISCAS.1994.409587

4. Arena, P., Baglio, S., Fortuna, L., Xibilia, M.: Chaotic time series prediction via quaternionic multilayer perceptrons. In: 1995 IEEE International Conference on Systems, Man and Cybernetics. Intelligent Systems for the 21st Century, vol. 2, pp. 1790–1794. IEEE, Vancouver (1995). https://doi.org/10.1109/ICSMC.1995.538035. http://ieeexplore.ieee.org/document/538035/

5. Arena, P., Fortuna, L., Muscato, G., Xibilia, M.G.: Multilayer perceptrons to approximate quaternion valued functions. Neural Netw. 10(2), 335–342 (1997). https://doi.org/10.1016/S0893-6080(96)00048-2. http://www.sciencedirect.com/science/article/pii/S0893608096000482