Emergence of Prediction by Reinforcement Learning Using a Recurrent Neural Network-Reference-Cited by-同舟云学术

Emergence of Prediction by Reinforcement Learning Using a Recurrent Neural Network

Published:2010 Issue: Volume:2010 Page:1-9
ISSN:1687-9600
Container-title:Journal of Robotics
language:en
Short-container-title:Journal of Robotics

Author:

Goto Kenta¹,Shibata Katsunari¹

Affiliation:

1. Department of Electrical and Electronic Engineering, Oita University, 700 Dannoharu, Oita 870-1192, Japan

Abstract

To develop a robot that behaves flexibly in the real world, it is essential that it learns various necessary functions autonomously without receiving significant information from a human in advance. Among such functions, this paper focuses on learning “prediction” that is attracting attention recently from the viewpoint of autonomous learning. The authors point out that it is important to acquire through learning not only the way of predicting future information, but also the purposive extraction of prediction target from sensor signals. It is suggested that through reinforcement learning using a recurrent neural network, both emerge purposively and simultaneously without testing individually whether or not each piece of information is predictable. In a task where an agent gets a reward when it catches a moving object that can possibly become invisible, it was observed that the agent learned to detect the necessary factors of the object velocity before it disappeared, to relay the information among some hidden neurons, and finally to catch the object at an appropriate position and timing, considering the effects of bounces off a wall after the object became invisible.

Funder

Japan Society for the Promotion of Science

Publisher

Hindawi Limited

Subject

General Computer Science,Control and Systems Engineering

Link

http://downloads.hindawi.com/journals/jr/2010/437654.pdf

Reference13 articles.

1. Learning to generate articulated behavior through the bottom-up and the top-down interaction processes

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Research on a Fast Human-Detection Algorithm for Unmanned Surveillance Area in Bulk Ports;Mathematical Problems in Engineering;2014

2. Improved Stability Criteria of Static Recurrent Neural Networks with a Time-Varying Delay;The Scientific World Journal;2014

3. Acquisition of Context-Based Active Word Recognition by Q-Learning Using a Recurrent Neural Network;Robot Intelligence Technology and Applications 2;2014

4. Performance comparison of non-RNN and RNN in Emergence of Discrete Decision Making through Reinforcement Learning.;FRONT ARTIF INTEL AP;2012