Abstract
Tampered multimedia content is being increasingly used in a broad range of cybercrime activities. The spread of fake news, misinformation, digital kidnapping, and ransomware-related crimes are amongst the most recurrent crimes in which manipulated digital photos and videos are the perpetrating and disseminating medium. Criminal investigation has been challenged in applying machine learning techniques to automatically distinguish between fake and genuine seized photos and videos. Despite the pertinent need for manual validation, easy-to-use platforms for digital forensics are essential to automate and facilitate the detection of tampered content and to help criminal investigators with their work. This paper presents a machine learning Support Vector Machines (SVM) based method to distinguish between genuine and fake multimedia files, namely digital photos and videos, which may indicate the presence of deepfake content. The method was implemented in Python and integrated as new modules in the widely used digital forensics application Autopsy. The implemented approach extracts a set of simple features resulting from the application of a Discrete Fourier Transform (DFT) to digital photos and video frames. The model was evaluated with a large dataset of classified multimedia files containing both legitimate and fake photos and frames extracted from videos. Regarding deepfake detection in videos, the Celeb-DFv1 dataset was used, featuring 590 original videos collected from YouTube, and covering different subjects. The results obtained with the 5-fold cross-validation outperformed those SVM-based methods documented in the literature, by achieving an average F1-score of 99.53%, 79.55%, and 89.10%, respectively for photos, videos, and a mixture of both types of content. A benchmark with state-of-the-art methods was also done, by comparing the proposed SVM method with deep learning approaches, namely Convolutional Neural Networks (CNN). Despite CNN having outperformed the proposed DFT-SVM compound method, the competitiveness of the results attained by DFT-SVM and the substantially reduced processing time make it appropriate to be implemented and embedded into Autopsy modules, by predicting the level of fakeness calculated for each analyzed multimedia file.
Subject
Electrical and Electronic Engineering,Computer Graphics and Computer-Aided Design,Computer Vision and Pattern Recognition,Radiology, Nuclear Medicine and imaging
Reference59 articles.
1. Accenture/Ponemon Institute: The Cost of Cybercrime
2. Cybercrime in Europe: surprising results of an expensive lapse
3. Cybersecurity: Ensuring Awareness and Resilience of the Private Sector Across Europe in Face of Mounting Cyber Risks-Study;Kertysova,2018
4. ENISA Threat Landscape—2020https://www.enisa.europa.eu/topics/threat-risk-management/threats-and-trends/
5. The Economics of Information Security
Cited by
15 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Enhancing Autopsy with G-Code File Recovery: Ingest Module Development;2024 International Conference on Computer, Information and Telecommunication Systems (CITS);2024-07-17
2. Unmasking Frame Duplication: Comprehensive Approach to Detection in Digital Video Using Machine Learning Algorithms and Signal Processing Techniques;2024 International Conference on Advances in Data Engineering and Intelligent Computing Systems (ADICS);2024-04-18
3. Detection of Manipulated Multimedia In Digital Forensics Using Machine Learning;2024 2nd International Conference on Device Intelligence, Computing and Communication Technologies (DICCT);2024-03-15
4. Deep Learning based Model for Deepfake Image Detection: An Analytical Approach;2023 3rd International Conference on Innovative Mechanisms for Industry Applications (ICIMIA);2023-12-21
5. A comprehensive evaluation of feature-based AI techniques for deepfake detection;Neural Computing and Applications;2023-12-14