Reinforcement Learning for RoboCup Soccer Keepaway-Reference-Cited by-同舟云学术

Reinforcement Learning for RoboCup Soccer Keepaway

Published:2005-09 Issue:3 Volume:13 Page:165-188
ISSN:1059-7123
Container-title:Adaptive Behavior
language:en
Short-container-title:Adaptive Behavior

Author:

Stone Peter¹,Sutton Richard S.²,Kuhlmann Gregory¹

Affiliation:

1. Department of Computer Sciences, The University of Texas at Austin,

2. Department of Computing Science, University of Alberta,

Abstract

RoboCup simulated soccer presents many challenges to reinforcement learning methods, including a large state space, hidden and uncertain state, multiple independent agents learning simultaneously, and long and variable delays in the effects of actions. We describe our application of episodic SMDP Sarsa(λ) with linear tile-coding function approximation and variable λ to learning higher-level decisions in a keepaway subtask of RoboCup soccer. In keepaway, one team, “the keepers,” tries to keep control of the ball for as long as possible despite the efforts of “the takers.” The keepers learn individually when to hold the ball and when to pass to a teammate. Our agents learned policies that significantly outperform a range of benchmark policies. We demonstrate the generality of our approach by applying it to a number of task variations including different field sizes and different numbers of players on each team.

Publisher

SAGE Publications

Subject

Behavioral Neuroscience,Experimental and Cognitive Psychology

Link

http://journals.sagepub.com/doi/pdf/10.1177/105971230501300301

Reference50 articles.

1. Refinement of soccer agents' positions using reinforcement learning

2. Evolving Team Darwin United

Cited by 178 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Eligibility traces in an autonomous soccer robot with obstacle avoidance and navigation policy;Applied Soft Computing;2024-10

2. Obtaining the optimal shortest path between two points on a quasi-developable Bézier-type surface using the Geodesic-based Q-learning algorithm;Engineering Applications of Artificial Intelligence;2024-10

3. Automated design and optimization of distributed filter circuits using reinforcement learning;Journal of Computational Design and Engineering;2024-07-16

4. Learning agile soccer skills for a bipedal robot with deep reinforcement learning;Science Robotics;2024-04-10

5. From mimic to counteract: a two-stage reinforcement learning algorithm for Google research football;Neural Computing and Applications;2024-02-22