AARF: Autonomous Attack Response Framework for Honeypots to Enhance Interaction Based on Multi-Agent Dynamic Game-Reference-Cited by-同舟云学术

AARF: Autonomous Attack Response Framework for Honeypots to Enhance Interaction Based on Multi-Agent Dynamic Game

Published:2024-05-11 Issue:10 Volume:12 Page:1508
ISSN:2227-7390
Container-title:Mathematics
language:en
Short-container-title:Mathematics

Author:

Wang Le¹²^ORCID,Deng Jianyu¹,Tan Haonan¹,Xu Yinghui¹,Zhu Junyi¹,Zhang Zhiqiang³,Li Zhaohua⁴,Zhan Rufeng¹,Gu Zhaoquan²³

Affiliation:

1. Cyberspace Institute of Advanced Technology, Guangzhou University, Guangzhou 510006, China

2. Department of New Networks, Peng Cheng Laboratory, Shenzhen 518055, China

3. School of Computer Science and Technology, Harbin Institute of Technology (Shenzhen), Shenzhen 518055, China

4. Shenzhen Institute for Advanced Study, University of Electronic Science and Technology of China, Shenzhen 518000, China

Abstract

Highly interactive honeypots can form reliable connections by responding to attackers to delay and capture intranet attacks. However, current research focuses on modeling the attacker as part of the environment and defining single-step attack actions by simulation to study the interaction of honeypots. It ignores the iterative nature of the attack and defense game, which is inconsistent with the correlative and sequential nature of actions in real attacks. These limitations lead to insufficient interaction of the honeypot response strategies generated by the study, making it difficult to support effective and continuous games with attack behaviors. In this paper, we propose an autonomous attack response framework (named AARF) to enhance interaction based on multi-agent dynamic games. AARF consists of three parts: a virtual honeynet environment, attack agents, and defense agents. Attack agents are modeled to generate multi-step attack chains based on a Hidden Markov Model (HMM) combined with the generic threat framework ATT&CK (Adversarial Tactics, Techniques, and Common Knowledge). The defense agents iteratively interact with the attack behavior chain based on reinforcement learning (RL) to learn to generate honeypot optimal response strategies. Aiming at the sample utilization inefficiency problem of random uniform sampling widely used in RL, we propose the dynamic value label sampling (DVLS) method in the dynamic environment. DVLS can effectively improve the sample utilization during the experience replay phase and thus improve the learning efficiency of honeypot agents under the RL framework. We further couple it with a classic DQN to replace the traditional random uniform sampling method. Based on AARF, we instantiate different functional honeypot models for deception in intranet scenarios. In the simulation environment, honeypots collaboratively respond to multi-step intranet attack chains to defend against these attacks, which demonstrates the effectiveness of AARF. The average cumulative reward of the DQN with DVLS is beyond eight percent, and the convergence speed is improved by five percent compared to a classic DQN.

Funder

Guangdong Basic and Applied Basic Research Foundation

Guangdong High-level University Foundation Program

National Natural Science Foundation of China

Publisher

MDPI AG

Link

https://www.mdpi.com/2227-7390/12/10/1508/pdf

Reference27 articles.

1. A firewall policy anomaly detection framework for reliable network security;Togay;IEEE Trans. Reliab.,2021

2. STG2P: A two-stage pipeline model for intrusion detection based on improved LightGBM and K-means;Zhang;Simul. Model. Pract. Theory,2022

3. Rohith, C., and Kaur, G. (2021, January 28–30). A comprehensive study on malware detection and prevention techniques used by anti-virus. Proceedings of the 2021 2nd International Conference on Intelligent Engineering and Management (Iciem), London, UK.

4. Liu, S., Wang, S., and Sun, K. (2023, January 27–30). Enhancing Honeypot Fidelity with Real-Time User Behavior Emulation. Proceedings of the 2023 53rd Annual IEEE/IFIP International Conference on Dependable Systems and Networks-Supplemental Volume (DSN-S), Porto, Portugal.

5. López-Morales, E., Rubio-Medrano, C., Doupé, A., Shoshitaishvili, Y., Wang, R., Bao, T., and Ahn, G.J. (2020, January 9–13). Honeyplc: A next-generation honeypot for industrial control systems. Proceedings of the 2020 ACM SIGSAC Conference on Computer and Communications Security, Virtual Event.