A Phase‐Change Memristive Reinforcement Learning for Rapidly Outperforming Champion Street‐Fighter Players-Reference-Cited by-同舟云学术

A Phase‐Change Memristive Reinforcement Learning for Rapidly Outperforming Champion Street‐Fighter Players

Published:2023-08-27 Issue:11 Volume:5 Page:
ISSN:2640-4567
Container-title:Advanced Intelligent Systems
language:en
Short-container-title:Advanced Intelligent Systems

Author:

Go Shao-Xiang¹,Jiang Yu¹,Loke Desmond K.¹^ORCID

Affiliation:

1. Department of Science, Mathematics and Technology and The AI Mega Centre Singapore University of Technology and Design Singapore 487372 Singapore

Abstract

The interactions with humans, and simultaneously, making of real‐time decisions in physical systems, are involved in many applications of artificial intelligence. An example of these conditions is maneuver sports. Movement‐type simulations, viz., the esports game Street Fighter (SF), recapitulate the complex multicharacter interactions and, concurrently, generate the millisecond‐level control challenges of human athletes. Herein, the physical and mental signatures of the SF agent (it is called SF R2) are controlled by utilizing a previously unreported model‐free, natural, deep reinforcement learning algorithm “Decay‐based Phase‐change memristive character‐type Proximal Policy Optimization” (DP‐PPO) through an assemblage of hybrid case‐type training processes; and an integrated training configuration for time‐trial evaluations, as well as competitions with a world's best SF player, is developed. A short length of time utilized by the SF R2 to defeat the opponent and, simultaneously, maintaining a good health level is achieved, as well as excellent handling of imperfect information settings. Training studies reveal a moderate maneuver etiquette in the SF R2, along with rapid, effective head‐to‐head competitions with one of the world's best SF player. This paves the way for achieving a broadly applicable training scheme, capable of quickly controlling complicated‐movement systems in fields where agents should observe unspecified human norms.

Funder

Ministry of Education - Singapore

Publisher

Wiley

Subject

General Medicine

Link

https://onlinelibrary.wiley.com/doi/pdf/10.1002/aisy.202300335

Reference100 articles.

1. Grandmaster level in StarCraft II using multi-agent reinforcement learning

2. A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play

3. Outracing champion Gran Turismo drivers with deep reinforcement learning

4. Human-level control through deep reinforcement learning

5. Real-time model calibration with deep reinforcement learning

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Toward Memristive Phase‐Change Neural Network with High‐Quality Ultra‐Effective Highly‐Self‐Adjustable Online Learning;Advanced Physics Research;2024-01-23

2. Nonvolatile Memristive Materials and Physical Modeling for In‐Memory and In‐Sensor Computing;Small Science;2024-01-22