A review of reinforcement learning based hyper-heuristics-Reference-Cited by-同舟云学术

A review of reinforcement learning based hyper-heuristics

Published:2024-06-28 Issue: Volume:10 Page:e2141
ISSN:2376-5992
Container-title:PeerJ Computer Science
language:en
Short-container-title:

Author:

Li Cuixia¹,Wei Xiang¹,Wang Jing¹,Wang Shuozhe¹,Zhang Shuyan¹

Affiliation:

1. School of Cyber Science and Engineering, Zhengzhou University, Zhengzhou, Henan, China

Abstract

The reinforcement learning based hyper-heuristics (RL-HH) is a popular trend in the field of optimization. RL-HH combines the global search ability of hyper-heuristics (HH) with the learning ability of reinforcement learning (RL). This synergy allows the agent to dynamically adjust its own strategy, leading to a gradual optimization of the solution. Existing researches have shown the effectiveness of RL-HH in solving complex real-world problems. However, a comprehensive introduction and summary of the RL-HH field is still blank. This research reviews currently existing RL-HHs and presents a general framework for RL-HHs. This article categorizes the type of algorithms into two categories: value-based reinforcement learning hyper-heuristics and policy-based reinforcement learning hyper-heuristics. Typical algorithms in each category are summarized and described in detail. Finally, the shortcomings in existing researches on RL-HH and future research directions are discussed.

Funder

The National Key Technologies Research and Development Program

Key Special Technologies Research and Development Program in HenanProvince

Major Science and Technology Project in Henan Province

Key Scientific Research Project of Colleges and Universities in Henan Province

Henan Provincial Science and Technology Research Project

Publisher

PeerJ

Link

https://peerj.com/articles/cs-2141.pdf

Reference111 articles.

1. An indoor scene recognition system based on deep learning evolutionary algorithms;Afif;Soft Computing,2023

2. Perturbation based variable neighbourhood search in heuristic space for examination timetabling problem;Ahmadi,2003

3. A reinforcement learning hyper-heuristic for water distribution network optimisation;Ahmed,2021

4. An evaluation of Monte Carlo-based hyper-heuristic for interaction testing of industrial embedded software;Ahmed;Soft Computing,2020

5. Limits to learning in reinforcement learning hyper-heuristics;Alanazi,2016