Deep Q-Network with Reinforcement Learning for Fault Detection in Cyber-Physical Systems

Author:

Stanly Jayaprakash J.1,Priyadarsini M. Jasmine Pemeena2,Parameshachari B. D.3ORCID,Karimi Hamid Reza4,Gurumoorthy Sasikumar5

Affiliation:

1. Mahendra Institute of Technology, Namakkal 637503, Tamil Nadu, India

2. Vellore Institute of Technology, Vellore Campus, Vellore 632014, Tamil Nadu, India

3. Department of Telecommunication Engineering, GSSS Institute of Engineering & Technology for Women, Mysuru 570016, Karnataka, India

4. Department of Mechanical Engineering, Politecnico di Milano, 20156 Milan, Italy

5. Jerusalem Collège of Engineering, Chennai - 600100, Tamil Nadu, India

Abstract

Cyber-Physical System (CPS) is an integration of physical components like actuators, sensors and various types of equipment with the Internet possessing computational ability for efficient communication. A Heterogeneous Independent Network (HINT) is a realistic model that is used for the analysis of inter-dependability between the power grid and communications network. In the traditional Deep [Formula: see text]-Learning method, action needs to be stored in the [Formula: see text] table for the prediction. In real case studies, many state and action values affect the performance of the model. Existing Deep [Formula: see text]-Network (DQN) model generates all possible actions for the [Formula: see text]-values and this involves the generation of excessive information that causes the model to overfit. In this research, the Neural Network is applied to estimate the state–action in the DQN and to store the particular state–action value instead of storing all the state–action values as followed in the traditional method. The HINT model provides realistic failure propagation in the network and its state–action value overfits the existing DQN method due to the presence of more information. The proposed DQN with reinforcement learning stores selected state–action values in the [Formula: see text] tables and eliminates irrelevant information that helps to increase the accuracy and reduce the computational time. The DQN with reinforcement learning is applied to adaptively learn the system to select the optimal action in a continuous interaction with a stochastic environment. The proposed DQN model involves the application of reward function to store state–action value with higher probability based on prediction and eliminates other state–action values. Features such as intra-degree, inter-betweenness, substation-betweenness, relay-betweenness and feature vector are extracted and given as input to the DQN to characterize the critical nodes. The proposed DQN method is evaluated on the HINT network and synthetic network to analyze its efficiency in fault detection. The result shows that the HINT network has a lower prediction error compared to the existing Deep Neural Network (DNN) method. The proposed DQN and LSTM models have accuracies of 98% and 93% in fault prediction, respectively.

Publisher

World Scientific Pub Co Pte Ltd

Subject

Electrical and Electronic Engineering,Hardware and Architecture,Electrical and Electronic Engineering,Hardware and Architecture

Cited by 5 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. A fault reconfiguration strategy based on logical structure and improved reinforcement learning for ship DC regional grid;Journal of the Franklin Institute;2024-10

2. Intelligent Detection Method of Power Grid Fault Big Data Based on Deep Learning;2023 3rd International Conference on New Energy and Power Engineering (ICNEPE);2023-11-24

3. Impulsive Accelerated Reinforcement Learning for $$H_\infty $$ Control;Neural Information Processing;2023-11-15

4. WFLTree: A Spanning Tree Construction for Federated Learning in Wireless Networks;Journal of Circuits, Systems and Computers;2023-02-23

5. An Empirical Analysis on Detection and Recognition of Intra-Cranial Hemorrhage (ICH) using 3D Computed Tomography (CT) images;2022 IEEE 2nd Mysore Sub Section International Conference (MysuruCon);2022-10-16

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3