Advancing Sustainable Manufacturing: Reinforcement Learning with Adaptive Reward Machine Using an Ontology-Based Approach-Reference-Cited by-同舟云学术

Advancing Sustainable Manufacturing: Reinforcement Learning with Adaptive Reward Machine Using an Ontology-Based Approach

Published:2024-07-10 Issue:14 Volume:16 Page:5873
ISSN:2071-1050
Container-title:Sustainability
language:en
Short-container-title:Sustainability

Author:

Golpayegani Fatemeh¹,Ghanadbashi Saeedeh¹^ORCID,Zarchini Akram²

Affiliation:

1. School of Computer Science, University College Dublin, D04 V1W8 Dublin, Ireland

2. Department of Computer Engineering, Sharif University of Technology, Tehran 11155-9517, Iran

Abstract

Sustainable manufacturing practices are crucial in job shop scheduling (JSS) to enhance the resilience of production systems against resource shortages and regulatory changes, contributing to long-term operational stability and environmental care. JSS involves rapidly changing conditions and unforeseen disruptions that can lead to inefficient resource use and increased waste. However, by addressing these uncertainties, we can promote more sustainable operations. Reinforcement learning-based job shop scheduler agents learn through trial and error by receiving scheduling decisions feedback in the form of a reward function (e.g., maximizing machines working time) from the environment, with their primary challenge being the handling of dynamic reward functions and navigating uncertain environments. Recently, Reward Machines (RMs) have been introduced to specify and expose reward function structures through a finite-state machine. With RMs, it is possible to define multiple reward functions for different states and switch between them dynamically. RMs can be extended to incorporate domain-specific prior knowledge, such as task-specific objectives. However, designing RMs becomes cumbersome as task complexity increases and agents must react to unforeseen events in dynamic and partially observable environments. Our proposed Ontology-based Adaptive Reward Machine (ONTOADAPT-REWARD) model addresses these challenges by dynamically creating and modifying RMs based on domain ontologies. This adaptability allows the model to outperform a state-of-the-art baseline algorithm in resource utilization, processed orders, average waiting time, and failed orders, highlighting its potential for sustainable manufacturing by optimizing resource usage and reducing idle times.

Funder

Science Foundation Ireland

Publisher

MDPI AG

Link

https://www.mdpi.com/2071-1050/16/14/5873/pdf

Reference69 articles.

1. International energy outlook 2013;Briefing;US Energy Inf. Adm.,2013

2. International Energy Agency (2022). Global Energy Review 2022, International Energy Agency.

3. A novel mathematical model and multi-objective method for the low-carbon flexible job shop scheduling problem;Yin;Sustain. Comput. Inform. Syst.,2017

4. On analysing sustainability assessment in manufacturing organisations: A survey;Eslami;Int. J. Prod. Res.,2021

5. Popper, J., Motsch, W., David, A., Petzsche, T., and Ruskowski, M. (2021, January 7–8). Utilizing multi-agent deep reinforcement learning for flexible job shop scheduling under sustainable viewpoints. Proceedings of the International Conference on Electrical, Computer, Communications and Mechatronics Engineering (ICECCME), Mauritius.