A Bidirectional Feedforward Neural Network Architecture Using the Discretized Neural Memory Ordinary Differential Equation-Reference-Cited by-同舟云学术

A Bidirectional Feedforward Neural Network Architecture Using the Discretized Neural Memory Ordinary Differential Equation

Published:2024-02-06 Issue:04 Volume:34 Page:
ISSN:0129-0657
Container-title:International Journal of Neural Systems
language:en
Short-container-title:Int. J. Neur. Syst.

Author:

Niu Hao¹^ORCID,Yi Zhang¹^ORCID,He Tao¹^ORCID

Affiliation:

1. College of Computer Science, Sichuan University, Chengdu 610065, P. R. China

Abstract

Deep Feedforward Neural Networks (FNNs) with skip connections have revolutionized various image recognition tasks. In this paper, we propose a novel architecture called bidirectional FNN (BiFNN), which utilizes skip connections to aggregate features between its forward and backward paths. The BiFNN accepts any FNN as a plugin that can incorporate any general FNN model into its forward path, introducing only a few additional parameters in the cross-path connections. The backward path is implemented as a nonparameter layer, utilizing a discretized form of the neural memory Ordinary Differential Equation (nmODE), which is named [Formula: see text]-net. We provide a proof of convergence for the [Formula: see text]-net and evaluate its initial value problem. Our proposed architecture is evaluated on diverse image recognition datasets, including Fashion-MNIST, SVHN, CIFAR-10, CIFAR-100, and Tiny-ImageNet. The results demonstrate that BiFNNs offer significant improvements compared to embedded models such as ConvMixer, ResNet, ResNeXt, and Vision Transformer. Furthermore, BiFNNs can be fine-tuned to achieve comparable performance with embedded models on Tiny-ImageNet and ImageNet-1K datasets by loading the same pretrained parameters.

Publisher

World Scientific Pub Co Pte Ltd

Link

https://www.worldscientific.com/doi/pdf/10.1142/S0129065724500151

Reference65 articles.

1. Deep Residual Learning for Image Recognition

2. Densely Connected Convolutional Networks

3. Identity Mappings in Deep Residual Networks

4. Aggregated Residual Transformations for Deep Neural Networks

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Precise Localization for Anatomo-Physiological Hallmarks of the Cervical Spine by Using Neural Memory Ordinary Differential Equation;International Journal of Neural Systems;2024-07-25

2. A Forward Learning Algorithm for Neural Memory Ordinary Differential Equations;International Journal of Neural Systems;2024-06-21