Training Agents to Perform Sequential Behavior-Reference-Cited by-同舟云学术

Training Agents to Perform Sequential Behavior

Published:1994-01 Issue:3 Volume:2 Page:247-275
ISSN:1059-7123
Container-title:Adaptive Behavior
language:en
Short-container-title:Adaptive Behavior

Author:

Colombetti Marco¹,Dorigo Marco²

Affiliation:

1. Politecnico di Milano

2. Université Libre de Bruxelles

Abstract

This article is concerned with training an agent to perform sequential behavior. In previous work, we have been applying reinforcement learning techniques to control a reactive agent. Obviously, a purely reactive system is limited in the kind of interactions it can learn. In particular, it can learn what we call pseudosequences—that is, sequences of actions in which each action is selected on the basis of current sensory stimuli. It cannot learn proper sequences, in which actions must be selected also on the basis of some internal state. Moreover, it is a result of our research that effective learning of proper sequences is improved by letting the agent and the trainer communicate. First, we consider trainer-to-agent communication, introducing the concept of reinforcement sensor, which lets the learning robot explicitly know whether the last reinforcement was a reward or a punishment. We also show how the use of this sensor makes error recovery rules emerge. Then we introduce agent-to-trainer communication, which is used to disambiguate ambiguous training situations—that is, situations in which the observation of the agent's behavior does not provide the trainer with enough information to decide whether the agent's move is right or wrong. We also show an alternative solution to the problem of ambiguous situations, which involves learning to coordinate behavior in a simpler, unambiguous setting and then transferring what has been learned to a more complex situation. All the design choices we make are discussed and compared by means of experiments in a simulated world.

Publisher

SAGE Publications

Subject

Behavioral Neuroscience,Experimental and Cognitive Psychology

Link

http://journals.sagepub.com/doi/pdf/10.1177/105971239400200302

Reference26 articles.

1. Implicit parallelism in genetic algorithms

2. Classifier systems and genetic algorithms

Cited by 27 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A hybrid connectionist/LCS for hidden-state problems;Neural Computing and Applications;2024-04-22

2. Reinforcement-Driven Shaping of Sequence Learning in Neural Dynamics;From Animals to Animats 13;2014

3. Can We Learn Finite State Machine Robot Controllers from Interactive Demonstration?;Studies in Computational Intelligence;2010

4. Learning classifier systems: then and now;Evolutionary Intelligence;2008-02-08

5. Constructing a personalized e-learning system based on genetic algorithm and case-based reasoning approach;Expert Systems with Applications;2007-10