Efficient Multi-Objective Optimization on Dynamic Flexible Job Shop Scheduling Using Deep Reinforcement Learning Approach-Reference-Cited by-同舟云学术

Efficient Multi-Objective Optimization on Dynamic Flexible Job Shop Scheduling Using Deep Reinforcement Learning Approach

Published:2023-07-06 Issue:7 Volume:11 Page:2018
ISSN:2227-9717
Container-title:Processes
language:en
Short-container-title:Processes

Author:

Wu Zufa¹^ORCID,Fan Hongbo¹²,Sun Yimeng¹,Peng Manyu¹

Affiliation:

1. Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650504, China

2. Faculty of Modern Agricultural Engineering, Kunming University of Science and Technology, Kunming 650500, China

Abstract

Previous research focuses on approaches of deep reinforcement learning (DRL) to optimize diverse types of the single-objective dynamic flexible job shop scheduling problem (DFJSP), e.g., energy consumption, earliness and tardiness penalty and machine utilization rate, which gain many improvements in terms of objective metrics in comparison with metaheuristic algorithms such as GA (genetic algorithm) and dispatching rules such as MRT (most remaining time first). However, single-objective optimization in the job shop floor cannot satisfy the requirements of modern smart manufacturing systems, and the multiple-objective DFJSP has become mainstream and the core of intelligent workshops. A complex production environment in a real-world factory causes scheduling entities to have sophisticated characteristics, e.g., a job’s non-uniform processing time, uncertainty of the operation number and restraint of the due time, avoidance of the single machine’s prolonged slack time as well as overweight load, which make a method of the combination of dispatching rules in DRL brought up to adapt to the manufacturing environment at different rescheduling points and accumulate maximum rewards for a global optimum. In our work, we apply the structure of a dual layer DDQN (DLDDQN) to solve the DFJSP in real time with new job arrivals, and two objectives are optimized simultaneously, i.e., the minimization of the delay time sum and makespan. The framework includes two layers (agents): the higher one is named as a goal selector, which utilizes DDQN as a function approximator for selecting one reward form from six proposed ones that embody the two optimization objectives, while the lower one, called an actuator, utilizes DDQN to decide on an optimal rule that has a maximum Q value. The generated benchmark instances trained in our framework converged perfectly, and the comparative experiments validated the superiority and generality of the proposed DLDDQN.

Publisher

MDPI AG

Subject

Process Chemistry and Technology,Chemical Engineering (miscellaneous),Bioengineering

Link

https://www.mdpi.com/2227-9717/11/7/2018/pdf

Reference42 articles.

1. A review of dynamic job shop scheduling techniques;Mohan;Procedia Manuf.,2019

2. A survey of job shop scheduling problem: The types and models;Xiong;Comput. Oper. Res.,2022

3. Zhou, H., Gu, B., and Jin, C. (2022). Reinforcement Learning Approach for Multi-Agent Flexible Scheduling Problems. arXiv.

4. Zeng, Y., Liao, Z., Dai, Y., Wang, R., Li, X., and Yuan, B. (2022). Hybrid intelligence for dynamic job-shop scheduling with deep reinforcement learning and attention mechanism. arXiv.

5. A reinforcement learning approach to parameter estimation in dynamic job shop scheduling;Shahrabi;Comput. Ind. Eng.,2017

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Two-stage double deep Q-network algorithm considering external non-dominant set for multi-objective dynamic flexible job shop scheduling problems;Swarm and Evolutionary Computation;2024-10

2. Dynamic flexible job shop scheduling based on deep reinforcement learning;Proceedings of the Institution of Mechanical Engineers, Part B: Journal of Engineering Manufacture;2024-09-11

3. A discrete event simulator to implement deep reinforcement learning for the dynamic flexible job shop scheduling problem;Simulation Modelling Practice and Theory;2024-07

4. Fast Pareto set approximation for multi-objective flexible job shop scheduling via parallel preference-conditioned graph reinforcement learning;Swarm and Evolutionary Computation;2024-07

5. Multi-policy deep reinforcement learning for multi-objective multiplicity flexible job shop scheduling;Swarm and Evolutionary Computation;2024-06