Detecting Changes and Avoiding Catastrophic Forgetting in Dynamic Partially Observable Environments-Reference-Cited by-同舟云学术

Detecting Changes and Avoiding Catastrophic Forgetting in Dynamic Partially Observable Environments

Published:2020-12-23 Issue: Volume:14 Page:
ISSN:1662-5218
Container-title:Frontiers in Neurorobotics
language:
Short-container-title:Front. Neurorobot.

Author:

Dick Jeffery,Ladosz Pawel,Ben-Iwhiwhu Eseoghene,Shimadzu Hideyasu,Kinnell Peter,Pilly Praveen K.,Kolouri Soheil,Soltoggio Andrea

Abstract

The ability of an agent to detect changes in an environment is key to successful adaptation. This ability involves at least two phases: learning a model of an environment, and detecting that a change is likely to have occurred when this model is no longer accurate. This task is particularly challenging in partially observable environments, such as those modeled with partially observable Markov decision processes (POMDPs). Some predictive learners are able to infer the state from observations and thus perform better with partial observability. Predictive state representations (PSRs) and neural networks are two such tools that can be trained to predict the probabilities of future observations. However, most such existing methods focus primarily on static problems in which only one environment is learned. In this paper, we propose an algorithm that uses statistical tests to estimate the probability of different predictive models to fit the current environment. We exploit the underlying probability distributions of predictive models to provide a fast and explainable method to assess and justify the model's beliefs about the current environment. Crucially, by doing so, the method can label incoming data as fitting different models, and thus can continuously train separate models in different environments. This new method is shown to prevent catastrophic forgetting when new environments, or tasks, are encountered. The method can also be of use when AI-informed decisions require justifications because its beliefs are based on statistical evidence from observations. We empirically demonstrate the benefit of the novel method with simulations in a set of POMDP environments.

Funder

Defense Advanced Research Projects Agency

Publisher

Frontiers Media SA

Subject

Artificial Intelligence,Biomedical Engineering

Reference38 articles.

1. A new look at the statistical model identification;Akaike;IEEE Trans. Automat. Control,1974

2. A Markovian decision process;Bellman;Indiana Univ. Math. J,1957

3. Hilbert space embeddings of predictive state representations;Boots,2013

4. Eye movement analysis with switching hidden Markov models;Chuk;Behav. Res. Methods,2020

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A model of how hierarchical representations constructed in the hippocampus are used to navigate through space;Adaptive Behavior;2024-08-28

2. Fault diagnosis of pressure relief valve based on improved deep Residual Shrinking Network;Measurement;2024-01

3. A Novel Methodology Based on a Deep Neural Network and Data Mining for Predicting the Segmental Voltage Drop in Automated Guided Vehicle Battery Cells;Electronics;2023-11-13