SIFT: Sifting file types—application of explainable artificial intelligence in cyber forensics-Reference-Cited by-同舟云学术

SIFT: Sifting file types—application of explainable artificial intelligence in cyber forensics

Published:2024-09-11 Issue:1 Volume:7 Page:
ISSN:2523-3246
Container-title:Cybersecurity
language:en
Short-container-title:Cybersecurity

Author:

Alam Shahid^ORCID,Demir Alper Kamil

Abstract

AbstractArtificial Intelligence (AI) is being applied to improve the efficiency of software systems used in various domains, especially in the health and forensic sciences. Explainable AI (XAI) is one of the fields of AI that interprets and explains the methods used in AI. One of the techniques used in XAI to provide such interpretations is by computing the relevance of the input features to the output of an AI model. File fragment classification is one of the vital issues of file carving in Cyber Forensics (CF) and becomes challenging when the filesystem metadata is missing. Other major challenges it faces are: proliferation of file formats, file embeddings, automation, We leverage and utilize interpretations provided by XAI to optimize the classification of file fragments and propose a novel sifting approach, named SIFT (Sifting File Types). SIFT employs TF-IDF to assign weight to a byte (feature), which is used to select features from a file fragment. Threshold-based LIME and SHAP (the two XAI techniques) feature relevance values are computed for the selected features to optimize file fragment classification. To improve multinomial classification, a Multilayer Perceptron model is developed and optimized with five hidden layers, each layer with

$$i \times n$$

i × n neurons, where i = the layer number and n = the total number of classes in the dataset. When tested with 47,482 samples of 20 file types (classes), SIFT achieves a detection rate of 82.1% and outperforms the other state-of-the-art techniques by at least 10%. To the best of our knowledge, this is the first effort of applying XAI in CF for optimizing file fragment classification.

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1186/s42400-024-00241-9.pdf

Reference74 articles.

1. Adadi A, Berrada M (2018) Peeking inside the black-box: a survey on explainable artificial intelligence (XAI). IEEE Access 6:52138–52160

2. AfzaliSeresht N, Liu Q, Miao Y (2019) An explainable intelligence model for security event analysis. In: AI 2019: Advances in Artificial Intelligence: 32nd Australasian Joint Conference, Adelaide, SA, Australia, December 2–5, 2019, Proceedings 32, Springer, pp 315–327

3. Alam S (2022) Cyber Security: Past Present and Future. Lambert Academic Publishing, London, UK

4. Alam S (2023) Sift—file fragment classification without metadata. In: 3rd International Conference on Computing and Information Technology (ICCIT), IEEE, pp 123–129

5. Alam S, Altiparmak Z (2024) XAI-CF–Examining the role of explainable artificial intelligence in cyber forensics. arXiv preprint arXiv:2402:02452