A novel method-based reinforcement learning with deep temporal difference network for flexible double shop scheduling problem-Reference-Cited by-同舟云学术

A novel method-based reinforcement learning with deep temporal difference network for flexible double shop scheduling problem

Published:2024-04-20 Issue:1 Volume:14 Page:
ISSN:2045-2322
Container-title:Scientific Reports
language:en
Short-container-title:Sci Rep

Author:

Wang Xiao,Zhong Peisi,Liu Mei,Zhang Chao,Yang Shihao

Abstract

AbstractThis paper studies the flexible double shop scheduling problem (FDSSP) that considers simultaneously job shop and assembly shop. It brings about the problem of scheduling association of the related tasks. To this end, a reinforcement learning algorithm with a deep temporal difference network is proposed to minimize the makespan. Firstly, the FDSSP is defined as the mathematical model of the flexible job-shop scheduling problem joined to the assembly constraint level. It is translated into a Markov decision process that directly selects behavioral strategies according to historical machining state data. Secondly, the proposed ten generic state features are input into the deep neural network model to fit the state value function. Similarly, eight simple constructive heuristics are used as candidate actions for scheduling decisions. From the greedy mechanism, optimally combined actions of all machines are obtained for each decision step. Finally, a deep temporal difference reinforcement learning framework is established, and a large number of comparative experiments are designed to analyze the basic performance of this algorithm. The results showed that the proposed algorithm was better than most other methods, which contributed to solving the practical production problem of the manufacturing industry.

Funder

the Natural Science Foundation of Shandong Province

the National Natural Science Foundation of China

Publisher

Springer Science and Business Media LLC

Link

https://www.nature.com/articles/s41598-024-59414-8.pdf

Reference72 articles.

1. Friederich, J. & Lazarova-Molnar, S. Reliability assessment of manufacturing systems: A comprehensive overview, challenges and opportunities. J. Manuf. Syst. 72, 38–58. https://doi.org/10.1016/j.jmsy.2023.11.001 (2024).

2. Xu, Y. et al. Hybrid quantum particle swarm optimization and variable neighborhood search for flexible job-shop scheduling problem. J. Manuf. Syst. 73, 334–348. https://doi.org/10.1016/j.jmsy.2024.02.007 (2024).

3. Fernandes, J. M. R. C., Homayouni, S. M. & Fontes, D. B. M. M. Energy-efficient scheduling in job shop manufacturing systems: A literature review. Sustainability 14, 6264. https://doi.org/10.3390/su14106264 (2022).

4. Lu, H. L., Huang, G. Q. & Yang, H. D. Integrating order review/release and dispatching rules for assembly job shop scheduling using a simulation approach. Int. J. Prod. Res. 49, 647–669. https://doi.org/10.1080/00207540903524490 (2011).

5. Thuerer, M. et al. The application of workload control in assembly job shops: An assessment by simulation. Int. J. Prod. Res. 50, 5048–5062. https://doi.org/10.1080/00207543.2011.631600 (2012).