Aggregation–Decomposition-Based Multi-Agent Reinforcement Learning for Multi-Reservoir Operations Optimization-Reference-Cited by-同舟云学术

Aggregation–Decomposition-Based Multi-Agent Reinforcement Learning for Multi-Reservoir Operations Optimization

Published:2020-09-25 Issue:10 Volume:12 Page:2688
ISSN:2073-4441
Container-title:Water
language:en
Short-container-title:Water

Author:

Hooshyar Milad,Mousavi S. Jamshid^ORCID,Mahootchi Masoud,Ponnambalam Kumaraswamy

Abstract

Stochastic dynamic programming (SDP) is a widely-used method for reservoir operations optimization under uncertainty but suffers from the dual curses of dimensionality and modeling. Reinforcement learning (RL), a simulation-based stochastic optimization approach, can nullify the curse of modeling that arises from the need for calculating a very large transition probability matrix. RL mitigates the curse of the dimensionality problem, but cannot solve it completely as it remains computationally intensive in complex multi-reservoir systems. This paper presents a multi-agent RL approach combined with an aggregation/decomposition (AD-RL) method for reducing the curse of dimensionality in multi-reservoir operation optimization problems. In this model, each reservoir is individually managed by a specific operator (agent) while co-operating with other agents systematically on finding a near-optimal operating policy for the whole system. Each agent makes a decision (release) based on its current state and the feedback it receives from the states of all upstream and downstream reservoirs. The method, along with an efficient artificial neural network-based robust procedure for the task of tuning Q-learning parameters, has been applied to a real-world five-reservoir problem, i.e., the Parambikulam–Aliyar Project (PAP) in India. We demonstrate that the proposed AD-RL approach helps to derive operating policies that are better than or comparable with the policies obtained by other stochastic optimization methods with less computational burden.

Publisher

MDPI AG

Subject

Water Science and Technology,Aquatic Science,Geography, Planning and Development,Biochemistry

Link

https://www.mdpi.com/2073-4441/12/10/2688/pdf

Reference54 articles.

1. Optimal operational analysis of the Colorado-Big Thompson project;Hiew,1989

2. Monte Carlo Optimization for Reservoir Operation

3. Optimization Algorithms for Large-Scale Multireservoir Hydropower Systems;Hiew,1987

4. Optimization of Value of CVP's Hydropower Production

5. Two methods for large-scale nonlinear optimization and their comparison on a case study of hydropower optimization

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Beyond engineering: A review of reservoir management through the lens of wickedness, competing objectives and uncertainty;Environmental Modelling & Software;2023-09

2. Predictive MPC-Based Operation of Urban Drainage Systems Using Input Data-Clustered Artificial Neural Networks Rainfall Forecasting Models;Hydrology;2023-06-29

3. Evolutionary algorithm-based multiobjective reservoir operation policy optimisation under uncertainty;Environmental Research Communications;2022-11-23

4. Enhancements to explicit stochastic reservoir operation optimization method;Advances in Water Resources;2022-11

5. Comparison of Two Convergence Criterion in the Optimization Process Using a Recursive Method in a Multi-Reservoir System;Water;2022-09-21