Affiliation:
1. The George Washington University, USA
Abstract
Lateral movement is a key stage of system compromise used by advanced persistent threats. Detecting it is no simple task. When network host logs are abstracted into discrete temporal graphs, the problem can be reframed as anomalous edge detection in an evolving network. Research in modern deep graph learning techniques has produced many creative and complicated models for this task. However, as is the case in many machine learning fields, the generality of models is of paramount importance for accuracy and scalability during training and inference. In this article, we propose a formalized approach to this problem with a framework we call
Euler
. It consists of a model-agnostic graph neural network stacked upon a model-agnostic sequence encoding layer such as a recurrent neural network. Models built according to the
Euler
framework can easily distribute their graph convolutional layers across multiple machines for large performance improvements. Additionally, we demonstrate that
Euler
-based models are as good, or better, than every state-of-the-art approach to anomalous link detection and prediction that we tested. As anomaly-based intrusion detection systems, our models efficiently identified anomalous connections between entities with high precision and outperformed all other unsupervised techniques for anomalous lateral movement detection. Additionally, we show that as a piece of a larger anomaly detection pipeline,
Euler
models perform well enough for use in real-world systems. With more advanced, yet still lightweight, alerting mechanisms ingesting the embeddings produced by
Euler
models, precision is boosted from 0.243, to 0.986 on real-world network traffic.
Funder
DARPA
National Science Foundation
Publisher
Association for Computing Machinery (ACM)
Subject
Safety, Risk, Reliability and Quality,General Computer Science
Reference85 articles.
1. 2023. About zeek – Book of zeek (v5.1.0). Zeek Documentation (2023). https://docs.zeek.org/en/current/about.html.
2. 2019. Distributed RPC framework. PyTorch Master Documentation (2019). https://pytorch.org/docs/master/rpc.html.
3. 2022. Pytorch/tensorpipe: A tensor-aware point-to-point communication primitive for machine learning. Pytorch/tensorpipe (2022). Retrieved from https://github.com/pytorch/tensorpipe.
4. The UCI KDD archive of large data sets for data mining research and experimentation;Bay Stephen D.;ACM SIGKDD Explorations Newsletter,2000
5. 2014. Intel xeon processor E5-2683 v3 (35M Cache 2.00 GHz) product specifications. Intel Product Specifications: Processors (2014). Retrieved from https://ark.intel.com/content/www/us/en/ark/products/81055/intel-xeon-processor-e5-2683-v3-35m-cache-2-00-ghz.html.
Cited by
17 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献