Reward Maximization Through Discrete Active Inference-Reference-Cited by-同舟云学术

Reward Maximization Through Discrete Active Inference

Published:2023-04-18 Issue:5 Volume:35 Page:807-852
ISSN:0899-7667
Container-title:Neural Computation
language:en
Short-container-title:

Author:

Da Costa Lancelot¹,Sajid Noor²,Parr Thomas³,Friston Karl⁴,Smith Ryan⁵

Affiliation:

1. Department of Mathematics, Imperial College London, London SW7 2AZ, U.K. l.da-costa@imperial.ac.uk

2. Wellcome Centre for Human Neuroimaging, University College London, London, WC1N 3AR, U.K. noor.sajid.18@ucl.ac.uk

3. Wellcome Centre for Human Neuroimaging, University College London, London, WC1N 3AR, U.K. thomas.parr.12@ucl.ac.uk

4. Wellcome Centre for Human Neuroimaging, University College London, London, WC1N 3AR, U.K. k.friston@ucl.ac.uk

5. Laureate Institute for Brain Research, Tulsa, OK 74136, U.S.A. rsmith@laureateinstitute.org

Abstract

Abstract Active inference is a probabilistic framework for modeling the behavior of biological and artificial agents, which derives from the principle of minimizing free energy. In recent years, this framework has been applied successfully to a variety of situations where the goal was to maximize reward, often offering comparable and sometimes superior performance to alternative approaches. In this article, we clarify the connection between reward maximization and active inference by demonstrating how and when active inference agents execute actions that are optimal for maximizing reward. Precisely, we show the conditions under which active inference produces the optimal solution to the Bellman equation, a formulation that underlies several approaches to model-based reinforcement learning and control. On partially observed Markov decision processes, the standard active inference scheme can produce Bellman optimal actions for planning horizons of 1 but not beyond. In contrast, a recently developed recursive active inference scheme (sophisticated inference) can produce Bellman optimal actions on any finite temporal horizon. We append the analysis with a discussion of the broader relationship between active inference and reinforcement learning.

Publisher

MIT Press

Subject

Cognitive Neuroscience,Arts and Humanities (miscellaneous)

Link

https://direct.mit.edu/neco/article-pdf/35/5/807/2079473/neco_a_01574.pdf

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Research and application of the flatness target curve discrete dynamic programming based on two-dimensional decision making;Expert Systems with Applications;2024-12

2. On efficient computation in active inference;Expert Systems with Applications;2024-11

3. Complex behavior from intrinsic motivation to occupy future action-state path space;Nature Communications;2024-07-29

4. Transforming Perceptions: Exploring the Multifaceted Potential of Generative AI for People with Cognitive Disabilities (Preprint);2024-07-10

5. A Probabilistic Treatment of (PO)MDPs with Multiplicative Reward Structure;2024 European Control Conference (ECC);2024-06-25