Modular production control using deep reinforcement learning: proximal policy optimization-Reference-Cited by-同舟云学术

Modular production control using deep reinforcement learning: proximal policy optimization

Published:2021-05-22 Issue:8 Volume:32 Page:2335-2351
ISSN:0956-5515
Container-title:Journal of Intelligent Manufacturing
language:en
Short-container-title:J Intell Manuf

Author:

Mayer Sebastian^ORCID,Classen Tobias,Endisch Christian

Abstract

AbstractEU regulations on

$$\textit{CO}_2$$

CO 2 limits and the trend of individualization are pushing the automotive industry towards greater flexibility and robustness in production. One approach to address these challenges is modular production, where workstations are decoupled by automated guided vehicles, requiring new control concepts. Modular production control aims at throughput-optimal coordination of products, workstations, and vehicles. For this np-hard problem, conventional control approaches lack in computing efficiency, do not find optimal solutions, or are not generalizable. In contrast, Deep Reinforcement Learning offers powerful and generalizable algorithms, able to deal with varying environments and high complexity. One of these algorithms is Proximal Policy Optimization, which is used in this article to address modular production control. Experiments in several modular production control settings demonstrate stable, reliable, optimal, and generalizable learning behavior. The agent successfully adapts its strategies with respect to the given problem configuration. We explain how to get to this learning behavior, especially focusing on the agent’s action, state, and reward design.

Funder

Audi AG

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence,Industrial and Manufacturing Engineering,Software

Link

https://link.springer.com/content/pdf/10.1007/s10845-021-01778-z.pdf

Reference47 articles.

1. Altenmüller T, Stüker T, Waschneck B, Kuhnle A, Lanza G (2020) Reinforcement learning for an intelligent and autonomous production control of complex job-shops under time constraints. Production Engineering 14(3), 319–328.

2. Aydin M, Öztemel E (2000) Dynamic job-shop scheduling using reinforcement learning agents. Robotics and Autonomous Systems 33(2–3), 169–178.

3. Bochmann, L. S. (2018). Entwicklung und Bewertung eines flexiblen und dezentral gesteuerten Fertigungssystems fuer variantenreiche Produkte. Ph.d. thesis, ETH Zurich.

4. Boysen, N. (2007). Produktionsplanung bei variantenfließfertigung. Springer. In K. H. Waldmann & U. M. Stocker (Eds.), Operations Research Proceedings 2006, Operations Research Proceedings (Vol. 2006, pp. 11–15). Berlin, Heidelberg: Berlin Heidelberg.

5. Chen C, Xia B, Zhou Bh, Xi L (2015) A reinforcement learning based approach for a multiple-load carrier scheduling problem. Journal of Intelligent Manufacturing 26(6), 1233–1245.

Cited by 17 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Machine learning in smart production logistics: a review of technological capabilities;International Journal of Production Research;2024-07-22

2. Value Distribution DDPG With Dual-Prioritized Experience Replay for Coordinated Control of Coal-Fired Power Generation Systems;IEEE Transactions on Industrial Informatics;2024-06

3. Enhancing economic efficiency in modular production systems through deep reinforcement learning;Procedia CIRP;2024

4. Integrating Scheduling of Logistic Support Processes in Agent-Based Industry 4.0 Assembly Simulation;2023 Winter Simulation Conference (WSC);2023-12-10

5. Designing an adaptive and deep learning based control framework for modular production systems;Journal of Intelligent Manufacturing;2023-11-20