Tensor Implementation of Monte-Carlo Tree Search for Model-Based Reinforcement Learning-Reference-Cited by-同舟云学术

Tensor Implementation of Monte-Carlo Tree Search for Model-Based Reinforcement Learning

Published:2023-01-20 Issue:3 Volume:13 Page:1406
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Baláž Marek¹^ORCID,Tarábek Peter¹^ORCID

Affiliation:

1. Faculty of Management Science and Informatics, University of Žilina, Univerzitná 8215/1, 010 26 Žilina, Slovakia

Abstract

Monte-Carlo tree search (MCTS) is a widely used heuristic search algorithm. In model-based reinforcement learning, MCTS is often utilized to improve action selection process. However, model-based reinforcement learning methods need to process large number of observations during the training. If MCTS is involved, it is necessary to run one instance of MCTS for each observation in every iteration of training. Therefore, there is a need for efficient method to process multiple instances of MCTS. We propose a MCTS implementation that can process batch of observations in fully parallel fashion on a single GPU using tensor operations. We demonstrate efficiency of the proposed approach on a MuZero reinforcement learning algorithm. Empirical results have shown that our method outperforms other approaches and scale well with increasing number of observations and simulations.

Funder

Operational Program Integrated Infrastructure

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/13/3/1406/pdf

Reference47 articles.

1. Mastering the game of Go with deep neural networks and tree search;Silver;Nature,2016

2. Mastering atari, go, chess and shogi by planning with a learned model;Schrittwieser;Nature,2020

3. Deep reinforcement learning for autonomous driving: A survey;Kiran;IEEE Trans. Intell. Transp. Syst.,2021

4. Azar, A.T., Koubaa, A., Ali Mohamed, N., Ibrahim, H.A., Ibrahim, Z.F., Kazim, M., Ammar, A., Benjdira, B., Khamis, A.M., and Hameed, I.A. (2021). Drone deep reinforcement learning: A review. Electronics, 10.

5. Deep reinforcement learning for drone navigation using sensor data;Hodge;Neural Comput. Appl.,2021

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Advanced Power Converters and Learning in Diverse Robotic Innovation: A Review;Energies;2023-10-19

2. Reinforcement Learning for Weighted p-median Problem;2023 International Conference on Information and Digital Technologies (IDT);2023-06-22