Hardening Active Directory Graphs via Evolutionary Diversity Optimization based Policies-Reference-Cited by-同舟云学术

Hardening Active Directory Graphs via Evolutionary Diversity Optimization based Policies

Published:2024-08-12 Issue: Volume: Page:
ISSN:2688-3007
Container-title:ACM Transactions on Evolutionary Learning and Optimization
language:en
Short-container-title:ACM Trans. Evol. Learn. Optim.

Author:

Goel Diksha¹^ORCID,Ward Max²^ORCID,Neumann Aneta³^ORCID,Neumann Frank³^ORCID,Nguyen Hung³^ORCID,Guo Mingyu³^ORCID

Affiliation:

1. CSIRO’s Data61, Australia

2. Department of Computer Science and Software Engineering, University of Western Australia, Australia

3. School of Computer and Mathematical Sciences, University of Adelaide, Australia

Abstract

Active Directory (AD) is the default security management system for Windows domain networks. An AD environment can be described as a cyber-attack graph, with nodes representing computers, accounts, etc., and edges indicating existing accesses or known exploits that enable attackers to move from one node to another. This paper explores a Stackelberg game model between one attacker and one defender on an AD attack graph. The attacker’s goal is to maximize their chances of successfully reaching the destination before getting detected. The defender’s aim is to block a constant number of edges to minimize the attacker’s chance of success. The paper shows that the problem is #P-hard and, therefore, intractable to solve exactly. To defend the AD graph from cyber attackers, this paper proposes two defensive approaches. In the first approach, we convert the attacker’s problem to an exponential sized Dynamic Program that is approximated by a Neural Network (NN). Once trained, the NN serves as an efficient fitness function for defender’s Evolutionary Diversity Optimization based defensive policy. The diversity emphasis on the defender’s solution provides a diverse set of training samples, improving the training accuracy of our NN for modeling the attacker. In the second approach, we propose a RL based policy to solve the attacker’s problem and Critic network assisted Evolutionary Diversity Optimization based defensive policy to solve defender’s problem. Experimental results on synthetic AD graphs show that the proposed defensive policies are scalable, highly effective, approximate attacker’s problem accurately, and generate good defensive plans.

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/3688401

Reference81 articles.

1. Majid Abdulsatar, Hussain Ahmad, Diksha Goel, and Faheem Ullah. 2024. Towards Deep Learning Enabled Cybersecurity Risk Assessment for Microservice Architectures. arXiv preprint arXiv:2403.15169 (2024).

2. A Review on C3I Systems’ Security: Vulnerabilities, Attacks, and Countermeasures

3. Reinforcement Learning Approaches in Social Robotics

4. Deep Q-Learning Based Reinforcement Learning Approach for Network Intrusion Detection

5. Deep Reinforcement Adversarial Learning Against Botnet Evasion Attacks