Forward layer-wise learning of convolutional neural networks through separation index maximizing-Reference-Cited by-同舟云学术

Forward layer-wise learning of convolutional neural networks through separation index maximizing

Published:2024-04-13 Issue:1 Volume:14 Page:
ISSN:2045-2322
Container-title:Scientific Reports
language:en
Short-container-title:Sci Rep

Author:

Karimi Ali,Kalhor Ahmad,Sadeghi Tabrizi Melika

Abstract

AbstractThis paper proposes a forward layer-wise learning algorithm for CNNs in classification problems. The algorithm utilizes the Separation Index (SI) as a supervised complexity measure to evaluate and train each layer in a forward manner. The proposed method explains that gradually increasing the SI through layers reduces the input data’s uncertainties and disturbances, achieving a better feature space representation. Hence, by approximating the SI with a variant of local triplet loss at each layer, a gradient-based learning algorithm is suggested to maximize it. Inspired by the NGRAD (Neural Gradient Representation by Activity Differences) hypothesis, the proposed algorithm operates in a forward manner without explicit error information from the last layer. The algorithm’s performance is evaluated on image classification tasks using VGG16, VGG19, AlexNet, and LeNet architectures with CIFAR-10, CIFAR-100, Raabin-WBC, and Fashion-MNIST datasets. Additionally, the experiments are applied to text classification tasks using the DBPedia and AG’s News datasets. The results demonstrate that the proposed layer-wise learning algorithm outperforms state-of-the-art methods in accuracy and time complexity.

Publisher

Springer Science and Business Media LLC

Link

https://www.nature.com/articles/s41598-024-59176-3.pdf

Reference47 articles.

1. Werbos, P. New tools for prediction and analysis in the behavioral science. Ph. D. dissertation, Harvard University (1974).

2. Rumelhart, D. E., Hinton, G. E. & Williams, R. J. Learning internal representations by error propagation (California Univ San Diego La Jolla Inst for Cognitive Science, Tech. Rep., 1985).

3. Learning-logic, D. P. Casting the cortex of the human brain in silicon. Tech. Rep., Technical Report TR-47, Center for Computational Research in Economics and (1985).

4. Hinton, G. E., Srivastava, N., Krizhevsky, A., Sutskever, I. & Salakhutdinov, R. R. Improving neural networks by preventing co-adaptation of feature detectors. arXiv preprint arXiv:1207.0580 (2012).

5. He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 770–778 (2016).