Learning with sparse reward in a gap junction network inspired by the insect mushroom body-Reference-Cited by-同舟云学术

Learning with sparse reward in a gap junction network inspired by the insect mushroom body

Published:2024-05-23 Issue:5 Volume:20 Page:e1012086
ISSN:1553-7358
Container-title:PLOS Computational Biology
language:en
Short-container-title:PLoS Comput Biol

Author:

Wei Tianqi,Guo Qinghai,Webb Barbara^ORCID

Abstract

Animals can learn in real-life scenarios where rewards are often only available when a goal is achieved. This ‘distal’ or ‘sparse’ reward problem remains a challenge for conventional reinforcement learning algorithms. Here we investigate an algorithm for learning in such scenarios, inspired by the possibility that axo-axonal gap junction connections, observed in neural circuits with parallel fibres such as the insect mushroom body, could form a resistive network. In such a network, an active node represents the task state, connections between nodes represent state transitions and their connection to actions, and current flow to a target state can guide decision making. Building on evidence that gap junction weights are adaptive, we propose that experience of a task can modulate the connections to form a graph encoding the task structure. We demonstrate that the approach can be used for efficient reinforcement learning under sparse rewards, and discuss whether it is plausible as an account of the insect mushroom body.

Funder

Huawei Technologies

Publisher

Public Library of Science (PLoS)

Reference73 articles.

1. Gap junctions in the brain: hardwired but functionally versatile;R Gutiérrez;The Neuroscientist,2023

2. High-frequency population oscillations are predicted to occur in hippocampal pyramidal neuronal networks interconnected by axoaxonal gap junctions;RD Traub;Neuroscience,1999

3. Electrical coupling between pyramidal cells in adult cortical regions;A Mercer;Brain cell biology,2006

4. Subthreshold somatic voltage in neocortical pyramidal cells can control whether spikes propagate from the axonal plexus to axon terminals: a model study;E Munro;Journal of Neurophysiology,2012

5. Gap junction plasticity as a mechanism to regulate network-wide oscillations;G Pernelle;PLoS computational biology,2018

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Reinforcement learning as a robotics-inspired framework for insect navigation: from spatial representations to neural implementation;Frontiers in Computational Neuroscience;2024-09-09