Synthetic dataset generator for anomaly detection in a university environment
-
Published:2023-03-15
Issue:2
Volume:27
Page:417-422
-
ISSN:1088-467X
-
Container-title:Intelligent Data Analysis
-
language:
-
Short-container-title:IDA
Author:
Strnad Pavel,Švarc Lukáš,Berka Petr
Abstract
This article introduces a recently developed synthetic dataset generator, which contains anonymised data from the Prague University of Economics and Business information system logs. The generator is opensource and is able to scale this data time-wise and also perform injection of the data with cyberattackers’ behaviour patterns. The anonymised data still contains user behaviour patterns; therefore, individual anomalous behaviour can be detected. Different types of real attack behaviour patterns in the university environment have been selected; they are used to demonstrate attackers’ behaviour in synthetically created system logs. The mentioned features allow other researchers to benchmark their anomaly detection algorithms with complex data.
Subject
Artificial Intelligence,Computer Vision and Pattern Recognition,Theoretical Computer Science