Affiliation:
1. Research Institute CODE, University of the Bundeswehr Munich, Germany
2. School of Criminal Justice, University of Lausanne, Switzerland
Abstract
Digital forensics depends on data sets for various purposes like concept evaluation, educational training, and tool validation. Researchers have gathered such data sets into repositories and created data simulation frameworks for producing large amounts of data. Synthetic data often face skepticism due to its perceived deviation from real-world data, raising doubts about its realism. This paper addresses this concern, arguing that there is no definitive answer. We focus on four common digital forensic use cases that rely on data. Through these, we elucidate the specifications and prerequisites of data sets within their respective contexts. Our discourse uncovers that both real-world and synthetic data are indispensable for advancing digital forensic science, software, tools, and the competence of practitioners. Additionally, we provide an overview of available data set repositories and data generation frameworks, contributing to the ongoing dialogue on digital forensic data sets’ utility.
Keywords:
Digital forensic corpora; Data sets; Real-world data; Synthetic data; Data usage; Data synthesis, Types of data; Use cases; Realistic data; Data simulation frameworks
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Networks and Communications,Computer Science Applications,Hardware and Architecture,Safety Research,Information Systems,Software
Reference51 articles.
1. Are We Missing Labels? A Study of the Availability of Ground-Truth in Network Security Research
2. A Plea for Utilising Synthetic Data when Performing Machine Learning Based Cyber-Security Experiments
3. A research process that ensures reproducible network security research
4. Ibrahim Baggili and Frank Breitinger . 2015 . Data sources for advancing cyber forensics: what the social world has to offer . In 2015 AAAI Spring Symposium Series. Ibrahim Baggili and Frank Breitinger. 2015. Data sources for advancing cyber forensics: what the social world has to offer. In 2015 AAAI Spring Symposium Series.
5. Jon Berryhill. 2019. What is Metadata? https://www.computerforensics.com/news/what-is-metadata Jon Berryhill. 2019. What is Metadata? https://www.computerforensics.com/news/what-is-metadata
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献