Affiliation:
1. School of Computer Science, University College Dublin, D04 V1W8 Dublin, Ireland
2. Department of Computer Engineering, Sharif University of Technology, Tehran 11155-9517, Iran
Abstract
Sustainable manufacturing practices are crucial in job shop scheduling (JSS) to enhance the resilience of production systems against resource shortages and regulatory changes, contributing to long-term operational stability and environmental care. JSS involves rapidly changing conditions and unforeseen disruptions that can lead to inefficient resource use and increased waste. However, by addressing these uncertainties, we can promote more sustainable operations. Reinforcement learning-based job shop scheduler agents learn through trial and error by receiving scheduling decisions feedback in the form of a reward function (e.g., maximizing machines working time) from the environment, with their primary challenge being the handling of dynamic reward functions and navigating uncertain environments. Recently, Reward Machines (RMs) have been introduced to specify and expose reward function structures through a finite-state machine. With RMs, it is possible to define multiple reward functions for different states and switch between them dynamically. RMs can be extended to incorporate domain-specific prior knowledge, such as task-specific objectives. However, designing RMs becomes cumbersome as task complexity increases and agents must react to unforeseen events in dynamic and partially observable environments. Our proposed Ontology-based Adaptive Reward Machine (ONTOADAPT-REWARD) model addresses these challenges by dynamically creating and modifying RMs based on domain ontologies. This adaptability allows the model to outperform a state-of-the-art baseline algorithm in resource utilization, processed orders, average waiting time, and failed orders, highlighting its potential for sustainable manufacturing by optimizing resource usage and reducing idle times.
Funder
Science Foundation Ireland
Reference69 articles.
1. International energy outlook 2013;Briefing;US Energy Inf. Adm.,2013
2. International Energy Agency (2022). Global Energy Review 2022, International Energy Agency.
3. A novel mathematical model and multi-objective method for the low-carbon flexible job shop scheduling problem;Yin;Sustain. Comput. Inform. Syst.,2017
4. On analysing sustainability assessment in manufacturing organisations: A survey;Eslami;Int. J. Prod. Res.,2021
5. Popper, J., Motsch, W., David, A., Petzsche, T., and Ruskowski, M. (2021, January 7–8). Utilizing multi-agent deep reinforcement learning for flexible job shop scheduling under sustainable viewpoints. Proceedings of the International Conference on Electrical, Computer, Communications and Mechatronics Engineering (ICECCME), Mauritius.