Affiliation:
1. Department of Information Engineering and Mathematics, University of Siena, 53100 Siena, Italy
Abstract
The current image generative models have achieved a remarkably realistic image quality, offering numerous academic and industrial applications. However, to ensure these models are used for benign purposes, it is essential to develop tools that definitively detect whether an image has been synthetically generated. Consequently, several detectors with excellent performance in computer vision applications have been developed. However, these detectors cannot be directly applied as they areto multi-spectral satellite images, necessitating the training of new models. While two-class classifiers generally achieve high detection accuracies, they struggle to generalize to image domains and generative architectures different from those encountered during training. In this paper, we propose a one-class classifier based on Vector Quantized Variational Autoencoder 2 (VQ-VAE 2) features to overcome the limitations of two-class classifiers. We start by highlighting the generalization problem faced by binary classifiers. This was demonstrated by training and testing an EfficientNet-B4 architecture on multiple multi-spectral datasets. We then illustrate that the VQ-VAE 2-based classifier, which was trained exclusively on pristine images, could detect images from different domains and generated by architectures not encountered during training. Finally, we conducted a head-to-head comparison between the two classifiers on the same generated datasets, emphasizing the superior generalization capabilities of the VQ-VAE 2-based detector, wherewe obtained a probability of detection at a 0.05 false alarm rate of 1 for the blue and red channels when using the VQ-VAE 2-based detector, and 0.72 when we used the EfficientNet-B4 classifier.
Funder
Defense Advanced Research Projects Agency
Reference45 articles.
1. Deep learning in bioinformatics;Min;Briefings Bioinform.,2017
2. Deep learning;LeCun;Nature,2015
3. A survey of the usages of deep learning for natural language processing;Otter;IEEE Trans. Neural Networks Learn. Syst.,2020
4. Multimodal and multicontrast image fusion via deep generative models;Dimitri;Inf. Fusion,2022
5. Object detection with deep learning: A review;Zhao;IEEE Trans. Neural Networks Learn. Syst.,2019
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献