JFB: Jacobian-Free Backpropagation for Implicit Networks-Reference-Cited by-同舟云学术

JFB: Jacobian-Free Backpropagation for Implicit Networks

Published:2022-06-28 Issue:6 Volume:36 Page:6648-6656
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Fung Samy Wu,Heaton Howard,Li Qiuwei,Mckenzie Daniel,Osher Stanley,Yin Wotao

Abstract

A promising trend in deep learning replaces traditional feedforward networks with implicit networks. Unlike traditional networks, implicit networks solve a fixed point equation to compute inferences. Solving for the fixed point varies in complexity, depending on provided data and an error tolerance. Importantly, implicit networks may be trained with fixed memory costs in stark contrast to feedforward networks, whose memory requirements scale linearly with depth. However, there is no free lunch --- backpropagation through implicit networks often requires solving a costly Jacobian-based equation arising from the implicit function theorem. We propose Jacobian-Free Backpropagation (JFB), a fixed-memory approach that circumvents the need to solve Jacobian-based equations. JFB makes implicit networks faster to train and significantly easier to implement, without sacrificing test accuracy. Our experiments show implicit networks trained with JFB are competitive with feedforward networks and prior implicit networks given the same number of parameters.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 11 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A survey of contextual optimization methods for decision-making under uncertainty;European Journal of Operational Research;2025-01

2. Lightweight and Flexible Deep Equilibrium Learning for CSI Feedback in FDD Massive MIMO;2024 IEEE International Conference on Machine Learning for Communication and Networking (ICMLCN);2024-05-05

3. Efficient and generalizable cross-patient epileptic seizure detection through a spiking neural network;Frontiers in Neuroscience;2024-01-10

4. DEQ-MPI: A Deep Equilibrium Reconstruction With Learned Consistency for Magnetic Particle Imaging;IEEE Transactions on Medical Imaging;2024-01

5. Deep Equilibrium Object Detection;2023 IEEE/CVF International Conference on Computer Vision (ICCV);2023-10-01