A Discrete JAYA Algorithm Based on Reinforcement Learning and Simulated Annealing for the Traveling Salesman Problem-Reference-Cited by-同舟云学术

A Discrete JAYA Algorithm Based on Reinforcement Learning and Simulated Annealing for the Traveling Salesman Problem

Published:2023-07-22 Issue:14 Volume:11 Page:3221
ISSN:2227-7390
Container-title:Mathematics
language:en
Short-container-title:Mathematics

Author:

Xu Jun¹,Hu Wei¹^ORCID,Gu Wenjuan²,Yu Yongguang³^ORCID

Affiliation:

1. School of Systems Science, Beijing Jiaotong University, Beijing 100044, China

2. School of Modern Post, Beijing University of Posts and Telecommunications, Beijing 100876, China

3. School of Mathematics and Statistics, Beijing Jiaotong University, Beijing 100044, China

Abstract

The JAYA algorithm is a population-based meta-heuristic algorithm proposed in recent years which has been proved to be suitable for solving global optimization and engineering optimization problems because of its simplicity, easy implementation, and guiding characteristic of striving for the best and avoiding the worst. In this study, an improved discrete JAYA algorithm based on reinforcement learning and simulated annealing (QSA-DJAYA) is proposed to solve the well-known traveling salesman problem in combinatorial optimization. More specially, firstly, the basic Q-learning algorithm in reinforcement learning is embedded into the proposed algorithm such that it can choose the most promising transformation operator for the current state to update the solution. Secondly, in order to balance the exploration and exploitation capabilities of the QSA-DJAYA algorithm, the Metropolis acceptance criterion of the simulated annealing algorithm is introduced to determine whether to accept candidate solutions. Thirdly, 3-opt is applied to the best solution of the current iteration at a certain frequency to improve the efficiency of the algorithm. Finally, to evaluate the performance of the QSA-DJAYA algorithm, it has been tested on 21 benchmark datasets taken from TSPLIB and compared with other competitive algorithms in two groups of comparative experiments. The experimental and the statistical significance test results show that the QSA-DJAYA algorithm achieves significantly better results in most instances.

Funder

Fundamental Research Funds for the Central Universities

Double First-class Talent Introduction Project of China

National Natural Science Foundation of China

Publisher

MDPI AG

Subject

General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)

Link

https://www.mdpi.com/2227-7390/11/14/3221/pdf

Reference41 articles.

1. The Truck Dispatching Problem;Dantzig;Manag. Sci.,1959