Heterogeneous reinforcement learning for defending power grids against attacks

Author:

Moradi Mohammadamin1ORCID,Panahi Shirin1ORCID,Zhai Zheng-Meng1ORCID,Weng Yang1,Dirkman John2ORCID,Lai Ying-Cheng13ORCID

Affiliation:

1. School of Electrical, Computer and Energy Engineering, Arizona State University 1 , Tempe, Arizona 85287, USA

2. Resource Innovations 2 , 719 Main Street, Half Moon Bay, California 94019, USA

3. Department of Physics, Arizona State University 3 , Tempe, Arizona 85287, USA

Abstract

Reinforcement learning (RL) has been employed to devise the best course of actions in defending the critical infrastructures, such as power networks against cyberattacks. Nonetheless, even in the case of the smallest power grids, the action space of RL experiences exponential growth, rendering efficient exploration by the RL agent practically unattainable. The current RL algorithms tailored to power grids are generally not suited when the state-action space size becomes large, despite trade-offs. We address the large action-space problem for power grid security by exploiting temporal graph convolutional neural networks (TGCNs) to develop a parallel but heterogeneous RL framework. In particular, we divide the action space into smaller subspaces, each explored by an RL agent. How to efficiently organize the spatiotemporal action sequences then becomes a great challenge. We invoke TGCN to meet this challenge by accurately predicting the performance of each individual RL agent in the event of an attack. The top performing agent is selected, resulting in the optimal sequence of actions. First, we investigate the action-space size comparison for IEEE 5-bus and 14-bus systems. Furthermore, we use IEEE 14-bus and IEEE 118-bus systems coupled with the Grid2Op platform to illustrate the performance and action division influence on training times and grid survival rates using both deep Q-learning and Soft Actor Critic trained agents and Grid2Op default greedy agents. Our TGCN framework provides a computationally reasonable approach for generating the best course of actions to defend cyber physical systems against attacks.

Funder

Air Force Office of Scientific Research

Israel-US Binational Industrial Research and Development Foundation

Publisher

AIP Publishing

Reference54 articles.

1. The anatomy of a power grid blackout—Root causes and dynamics of recent major blackouts;IEEE Power Energy Mag.,2006

2. The 2015 Ukraine blackout: Implications for false data injection attacks;IEEE Trans. Power Syst.,2017

3. The colonial pipeline hack: Exposing vulnerabilities in us cybersecurity,2021

4. Don’t drink the cyber: Extrapolating the possibilities of Oldsmar’s water treatment cyberattack,2022

5. Curriculum learning for reinforcement learning domains: A framework and survey;J. Mach. Learn. Res.,2020

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3