A deep reinforcement learning assisted simulated annealing algorithm for a maintenance planning problem-Reference-Cited by-同舟云学术

A deep reinforcement learning assisted simulated annealing algorithm for a maintenance planning problem

Published:2022-03-15 Issue: Volume: Page:
ISSN:0254-5330
Container-title:Annals of Operations Research
language:en
Short-container-title:Ann Oper Res

Author:

Kosanoglu Fuat,Atmis Mahir,Turan Hasan Hüseyin^ORCID

Abstract

AbstractMaintenance planning aims to improve the reliability of assets, prevent the occurrence of asset failures, and reduce maintenance costs associated with downtime of assets and maintenance resources (such as spare parts and workforce). Thus, effective maintenance planning is instrumental in ensuring high asset availability with the minimum cost. Nevertheless, to find such optimal planning is a nontrivial task due to the (i) complex and usually nonlinear inter-relationship between different planning decisions (e.g., inventory level and workforce capacity), and (ii) stochastic nature of the system (e.g., random failures of parts installed in assets). To alleviate these challenges, we study a joint maintenance planning problem by considering several decisions simultaneously, including workforce planning, workforce training, and spare parts inventory management. We develop a hybrid solution algorithm (

$$\mathcal {DRLSA}$$

DRLSA ) that is a combination of Double Deep Q-Network based Deep Reinforcement Learning (DRL) and Simulated Annealing (SA) algorithms. In each episode of the proposed algorithm, the best solution found by DRL is delivered to SA to be used as an initial solution, and the best solution of SA is delivered to DRL to be used as the initial state. Different from the traditional SA algorithms where neighborhood structures are selected only randomly, the DRL part of

$$\mathcal {DRLSA}$$

DRLSA learns to choose the best neighborhood structure to use based on experience gained from previous episodes. We compare the performance of the proposed solution algorithm with several well-known meta-heuristic algorithms, including Simulated Annealing, Genetic Algorithm (GA), and Variable Neighborhood Search (VNS). Further, we also develop a Machine Learning (ML) algorithm (i.e., K-Median) as another benchmark in which different properties of spare parts (e.g., failure rates, holding costs, and repair rates) are used as clustering features for the ML algorithm. Our study reveals that the

$$\mathcal {DRLSA}$$

DRLSA finds the optimal solutions for relatively small-size instances, and it has the potential to outperform traditional meta-heuristic and ML algorithms.

Funder

University of New South Wales

Publisher

Springer Science and Business Media LLC

Subject

Management Science and Operations Research,General Decision Sciences

Link

https://link.springer.com/content/pdf/10.1007/s10479-022-04612-8.pdf

Reference75 articles.

1. Allen, T. T., Roychowdhury, S., & Liu, E. (2018). Reward-based Monte Carlo-Bayesian reinforcement learning for cyber preventive maintenance. Computers & Industrial Engineering, 126, 578–594.

2. Andriotis, C., & Papakonstantinou, K. (2019). Managing engineering systems with large state and action spaces through deep reinforcement learning. Reliability Engineering & System Safety, 191, 106483.

3. Andriotis, C. P., & Papakonstantinou, K. G. (2018). Managing engineering systems with large state and action spaces through deep reinforcement learning. CoRR, arXiv:1811.02052.

4. Arsenault, R. (2016). Stat of the week: The (rising!) cost of downtime. https://www.aberdeen.com/techpro-essentials/stat-of-the-week-the-rising-cost-of-downtime/. Accessed: 2021-03-07.

5. Bello, I., Pham, H., Le, Q. V., Norouzi, M., & Bengio, S. (2017). Neural combinatorial optimization with reinforcement learning. arXiv:1611.09940.

Cited by 13 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Efficient opportunistic maintenance strategies via pruning in parallel–series systems with economic dependence;Computers & Industrial Engineering;2024-10

2. Deep reinforcement learning-based preventive maintenance for repairable machines with deterioration in a flow line system;Annals of Operations Research;2024-08-06

3. Enhanced migrating birds optimization algorithm for optimization problems in different domains;Annals of Operations Research;2024-05-21

4. The third party logistics provider freight management problem: a framework and deep reinforcement learning approach;Annals of Operations Research;2024-02-26

5. Newton-Raphson-based optimizer: A new population-based metaheuristic algorithm for continuous optimization problems;Engineering Applications of Artificial Intelligence;2024-02