Combining backpropagation with Equilibrium Propagation to improve an Actor-Critic reinforcement learning framework-Reference-Cited by-同舟云学术

Combining backpropagation with Equilibrium Propagation to improve an Actor-Critic reinforcement learning framework

Published:2022-08-23 Issue: Volume:16 Page:
ISSN:1662-5188
Container-title:Frontiers in Computational Neuroscience
language:
Short-container-title:Front. Comput. Neurosci.

Author:

Kubo Yoshimasa,Chalmers Eric,Luczak Artur

Abstract

Backpropagation (BP) has been used to train neural networks for many years, allowing them to solve a wide variety of tasks like image classification, speech recognition, and reinforcement learning tasks. But the biological plausibility of BP as a mechanism of neural learning has been questioned. Equilibrium Propagation (EP) has been proposed as a more biologically plausible alternative and achieves comparable accuracy on the CIFAR-10 image classification task. This study proposes the first EP-based reinforcement learning architecture: an Actor-Critic architecture with the actor network trained by EP. We show that this model can solve the basic control tasks often used as benchmarks for BP-based models. Interestingly, our trained model demonstrates more consistent high-reward behavior than a comparable model trained exclusively by BP.

Publisher

Frontiers Media SA

Subject

Cellular and Molecular Neuroscience,Neuroscience (miscellaneous)

Reference40 articles.

1. A learning rule for asynchronous perceptrons with feedback in a combinatorial environment;Almeida;Proceedings of the IEEE 1st International Conference on Neural Networks,1987

2. Contrastive learning and neural oscillations.;Baldi;Neural Comput.,1991

3. The arcade learning environment: An evaluation platform for general agents.;Bellemare;J. Artif. Intell. Res.,2013

4. Openai gym.;Brockman;arXiv,2016

5. Reinforcement learning with brain-inspired modulation can improve adaptation to environmental changes.;Chalmers;arXiv,2022

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A priority experience replay actor-critic algorithm using self-attention mechanism for strategy optimization of discrete problems;PeerJ Computer Science;2024-06-28

2. Biologically-inspired neuronal adaptation improves learning in neural networks;Communicative & Integrative Biology;2023-01-17

3. Low-cost electronic-nose (LC-e-nose) systems for the evaluation of plantation and fruit crops: recent advances and future trends;Analytical Methods;2023