A Survey of Reinforcement Learning Algorithms for Dynamically Varying Environments-Reference-Cited by-同舟云学术

A Survey of Reinforcement Learning Algorithms for Dynamically Varying Environments

Published:2021-07 Issue:6 Volume:54 Page:1-25
ISSN:0360-0300
Container-title:ACM Computing Surveys
language:en
Short-container-title:ACM Comput. Surv.

Author:

Padakandla Sindhu¹

Affiliation:

1. Department of Computer Science and Automation, Indian Institute of Science, Bangalore, Karnataka, India

Abstract

Reinforcement learning (RL) algorithms find applications in inventory control, recommender systems, vehicular traffic management, cloud computing, and robotics. The real-world complications arising in these domains makes them difficult to solve with the basic assumptions underlying classical RL algorithms. RL agents in these applications often need to react and adapt to changing operating conditions. A significant part of research on single-agent RL techniques focuses on developing algorithms when the underlying assumption of stationary environment model is relaxed. This article provides a survey of RL methods developed for handling dynamically varying environment models. The goal of methods not limited by the stationarity assumption is to help autonomous agents adapt to varying operating conditions. This is possible either by minimizing the rewards lost during learning by RL agent or by finding a suitable policy for the RL agent that leads to efficient operation of the underlying system. A representative collection of these algorithms is discussed in detail in this work along with their categorization and their relative merits and demerits. Additionally, we also review works that are tailored to application domains. Finally, we discuss future enhancements for this field.

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science,Theoretical Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3459991

Reference77 articles.

1. Addressing environment non-stationarity by repeating Q-learning updates;Abdallah Sherief;J. Mach. Learn. Res.,2016

2. Blind Hexapod Locomotion in Complex Terrain with Gait Adaptation Using Deep Reinforcement Learning and Classification

3. Fundamental Limits of Age-of-Information in Stationary and Non-stationary Environments

Cited by 72 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Hebbian spatial encoder with adaptive sparse connectivity;Cognitive Systems Research;2024-12

2. An adaptable fuzzy reinforcement learning method for non-stationary environments;Neurocomputing;2024-11

3. Enhanced Safety in Autonomous Driving: Integrating a Latent State Diffusion Model for End-to-End Navigation;Sensors;2024-08-26

4. Learning obstacle avoidance and predation in complex reef environments with deep reinforcement learning;Bioinspiration & Biomimetics;2024-08-07

5. Hierarchical Reinforcement Learning-Based Routing Algorithm With Grouped RSU in Urban VANETs;IEEE Transactions on Intelligent Transportation Systems;2024-08