Decomposing user-defined tasks in a reinforcement learning setup using TextWorld-Reference-Cited by-同舟云学术

Decomposing user-defined tasks in a reinforcement learning setup using TextWorld

Published:2023-12-22 Issue: Volume:10 Page:
ISSN:2296-9144
Container-title:Frontiers in Robotics and AI
language:
Short-container-title:Front. Robot. AI

Author:

Petsanis Thanos,Keroglou Christoforos,Ch. Kapoutsis Athanasios,Kosmatopoulos Elias B.,Sirakoulis Georgios Ch.

Abstract

The current paper proposes a hierarchical reinforcement learning (HRL) method to decompose a complex task into simpler sub-tasks and leverage those to improve the training of an autonomous agent in a simulated environment. For practical reasons (i.e., illustrating purposes, easy implementation, user-friendly interface, and useful functionalities), we employ two Python frameworks called TextWorld and MiniGrid. MiniGrid functions as a 2D simulated representation of the real environment, while TextWorld functions as a high-level abstraction of this simulated environment. Training on this abstraction disentangles manipulation from navigation actions and allows us to design a dense reward function instead of a sparse reward function for the lower-level environment, which, as we show, improves the performance of training. Formal methods are utilized throughout the paper to establish that our algorithm is not prevented from deriving solutions.

Publisher

Frontiers Media SA

Subject

Artificial Intelligence,Computer Science Applications

Reference47 articles.

1. Safe reinforcement learning via shielding;Alshiekh;Proc. AAAI Conf. Artif. Intell.,2018

2. Vision-and-language navigation: interpreting visually-grounded navigation instructions in real environments;Anderson,2018

3. Recent advances in hierarchical reinforcement learning;Barto;Discrete event Dyn. Syst.,2003

4. Towards autonomous robotic butlers: lessons learned with the pr2;Bohren,2011