An Improved Ant Colony Algorithm with Deep Reinforcement Learning for the Robust Multiobjective AGV Routing Problem in Assembly Workshops-Reference-Cited by-同舟云学术

An Improved Ant Colony Algorithm with Deep Reinforcement Learning for the Robust Multiobjective AGV Routing Problem in Assembly Workshops

Published:2024-08-14 Issue:16 Volume:14 Page:7135
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Chen Yong¹,Chen Mingyu¹,Yu Feiyang¹,Lin Han¹,Yi Wenchao¹

Affiliation:

1. College of Mechanical Engineering, Zhejiang University of Technology, Hangzhou 310014, China

Abstract

Vehicle routing problems (VRPs) are challenging problems. Many variants of the VRP have been proposed. However, few studies on VRP have combined robustness and just-in-time (JIT) requirements with uncertainty. To solve the problem, this paper proposes the just-in-time-based robust multiobjective vehicle routing problem with time windows (JIT-RMOVRPTW) for the assembly workshop. Based on the conflict between uncertain time and JIT requirements, a JIT strategy was proposed. To measure the robustness of the solution, a metric was designed as the objective. Afterwards, a two-stage nondominated sorting ant colony algorithm with deep reinforcement learning (NSACOWDRL) was proposed. In stage I, ACO combines with NSGA-III to obtain the Pareto frontier. Based on the model, a pheromone update strategy and a transfer probability formula were designed. DDQN was introduced as a local search algorithm which trains networks through Pareto solutions to participate in probabilistic selection and nondominated sorting. In stage II, the Pareto frontier was quantified in feasibility by Monte Carlo simulation, and tested by diversity-robust selection based on uniformly distributed weights in the solution space to select robust Pareto solutions that take diversity into account. The effectiveness of NSACOWDRL was demonstrated through comparative experiments with other algorithms on instances. The impact of JIT strategy is analyzed and the effect of networks on the NSACOWDRL is further discussed.

Funder

Natural Science Foundation of Zhejiang province, China

Zhejiang Provincial Natural Science Foundation of China

Publisher

MDPI AG

Link

https://www.mdpi.com/2076-3417/14/16/7135/pdf

Reference57 articles.

1. Multiobjective Evolutionary Algorithms: A Comparative Case Study and the Strength Pareto Approach;Zitzler;IEEE Trans. Evol. Comput.,1999

2. Liu, X., Chen, Y.-L., Por, L.Y., and Ku, C.S. (2023). A Systematic Literature Review of Vehicle Routing Problems with Time Windows. Sustainability, 15.

3. An Overview and Experimental Study of Learning-Based Optimization Algorithms for the Vehicle Routing Problem;Li;IEEE/CAA J. Autom. Sin.,2022

4. Asghari, M., and Mirzapour Al-e-hashem, S.M.J. (2021). Green Vehicle Routing Problem: A State-of-the-Art Review. Int. J. Prod. Econ., 231.

5. Vehicle Routing: Review of Benchmark Datasets;Gunawan;J. Oper. Res. Soc.,2021