Pathfinding in stochastic environments: learning <i>vs</i> planning-Reference-Cited by-同舟云学术

Pathfinding in stochastic environments: learning vs planning

Published:2022-08-18 Issue: Volume:8 Page:e1056
ISSN:2376-5992
Container-title:PeerJ Computer Science
language:en
Short-container-title:

Author:

Skrynnik Alexey¹²³,Andreychuk Anton²,Yakovlev Konstantin²³,Panov Aleksandr¹³

Affiliation:

1. Cognitive Dynamic Systems, Moscow Institute of Physics and Technology, Moscow, Russia

2. Artificial Intelligence Research Institute AIRI, Moscow, Russia

3. Federal Research Center “Computer Science and Control” of the Russian Academy of Sciences, Moscow, Russia

Abstract

Among the main challenges associated with navigating a mobile robot in complex environments are partial observability and stochasticity. This work proposes a stochastic formulation of the pathfinding problem, assuming that obstacles of arbitrary shapes may appear and disappear at random moments of time. Moreover, we consider the case when the environment is only partially observable for an agent. We study and evaluate two orthogonal approaches to tackle the problem of reaching the goal under such conditions: planning and learning. Within planning, an agent constantly re-plans and updates the path based on the history of the observations using a search-based planner. Within learning, an agent asynchronously learns to optimize a policy function using recurrent neural networks (we propose an original efficient, scalable approach). We carry on an extensive empirical evaluation of both approaches that show that the learning-based approach scales better to the increasing number of the unpredictably appearing/disappearing obstacles. At the same time, the planning-based one is preferable when the environment is close-to-the-deterministic (i.e., external disturbances are rare). Code available at https://github.com/Tviskaron/pathfinding-in-stochastic-envs.

Publisher

PeerJ

Subject

General Computer Science

Link

https://peerj.com/articles/cs-1056.pdf

Reference38 articles.

1. Dota 2 with large scale deep reinforcement learning;Berner,2019

2. Simultaneous localization and mapping: a survey of current trends in autonomous driving;Bresson;IEEE Transactions on Intelligent Vehicles,2017

3. Autonomous mobile robot path planning in unknown dynamic environments using neural dynamics;Chen;Soft Computing,2020

4. Leveraging procedural generation to benchmark reinforcement learning;Cobbe,2020

5. Q-Mixing network for multi-agent pathfinding in partially observable grid environments;Davydov,2021

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. When to Switch: Planning and Learning for Partially Observable Multi-Agent Pathfinding;IEEE Transactions on Neural Networks and Learning Systems;2023

2. Grid Graph Reduction for Efficient Shortest Pathfinding;IEEE Access;2023

3. Monte-Carlo Tree Search for Multi-agent Pathfinding: Preliminary Results;Lecture Notes in Computer Science;2023

4. Planning and Learning in Multi-Agent Path Finding;Doklady Mathematics;2022-12

5. Reinforcement Learning with Success Induced Task Prioritization;Advances in Computational Intelligence;2022