Enhanced Reinforcement Learning Method Combining One-Hot Encoding-Based Vectors for CNN-Based Alternative High-Level Decisions-Reference-Cited by-同舟云学术

Enhanced Reinforcement Learning Method Combining One-Hot Encoding-Based Vectors for CNN-Based Alternative High-Level Decisions

Published:2021-02-01 Issue:3 Volume:11 Page:1291
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Gu Bonwoo,Sung Yunsick^ORCID

Abstract

Gomoku is a two-player board game that originated in ancient China. There are various cases of developing Gomoku using artificial intelligence, such as a genetic algorithm and a tree search algorithm. Alpha-Gomoku, Gomoku AI built with Alpha-Go’s algorithm, defines all possible situations in the Gomoku board using Monte-Carlo tree search (MCTS), and minimizes the probability of learning other correct answers in the duplicated Gomoku board situation. However, in the tree search algorithm, the accuracy drops, because the classification criteria are manually set. In this paper, we propose an improved reinforcement learning-based high-level decision approach using convolutional neural networks (CNN). The proposed algorithm expresses each state as One-Hot Encoding based vectors and determines the state of the Gomoku board by combining the similar state of One-Hot Encoding based vectors. Thus, in a case where a stone that is determined by CNN has already been placed or cannot be placed, we suggest a method for selecting an alternative. We verify the proposed method of Gomoku AI in GuPyEngine, a Python-based 3D simulation platform.

Funder

National Research Foundation of Korea

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/11/3/1291/pdf

Reference34 articles.

1. Design of gomoku ai based on machine game;Zheng;Comput. Knowl. Technol.,2016

2. Playing games with genetic algorithms;Marks,2002

3. An introduction to convolutional neural networks;O’Shea;arXiv,2015

4. A Survey of Monte Carlo Tree Search Methods

5. Using Genetic Algorithm to Solve Game of Go-Moku;Shah;IJCA Spec. Issue Optim. On-Chip Commun.,2012

Cited by 27 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Predicting MGMT Methylation in Glioblastoma for Informed Clinical Decisions: An AI-Driven Approach in Resource-Limited Settings;2024-07-26

2. A joint deep learning model for bearing fault diagnosis in noisy environments;Journal of Mechanical Science and Technology;2024-07

3. Customised product design optimisation considering module synergy effects and expert preferences;International Journal of Production Research;2024-06-04

4. An Intelligent Detection System for Wheat Appearance Quality;Agronomy;2024-05-16

5. A new framework for deep learning video based Human Action Recognition on the edge;Expert Systems with Applications;2024-03