Abstract
Upscaled video detection is a helpful tool in multimedia forensics, but it’s a challenging task that involves various upscaling and compression algorithms. There are many resolution-enhancement methods, including interpolation and deep-learning based super-resolution, and they leave unique traces. This paper proposes a new upscaled-resolution-detection method based on learning of visual representations using contrastive and cross-entropy losses. To explain how the method detects videos, the major components of our framework are systematically reviewed — in particular, it is shown that most data-augmentation approaches hinder the learning of the method. Through extensive experiments on various datasets, our method has been shown to effectively detects upscaling even in compressed videos and outperforms the state-of-theart alternatives. The code and models are publicly available at https://github.com/msu-video-group/SRDM.
Publisher
Keldysh Institute of Applied Mathematics
Reference32 articles.
1. Topaz Gigapixel AI, 2021. https://www.topazlabs.com/gigapixel-ai.
2. Adrien Bardes, Jean Ponce, and Yann LeCun. Vicreg: Varianceinvariance-covariance regularization for self-supervised learning. arXiv preprint arXiv:2105.04906, 2021.
3. Jianrui Cai, Hui Zeng, Hongwei Yong, Zisheng Cao, and Lei Zhang. Toward real-world single image super-resolution: A new benchmark and a new model. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 3086-3095, 2019. https://doi.org/10.1109/iccv.2019.00318
4. Gang Cao, Antao Zhou, Xianglin Huang, Gege Song, Lifang Yang, and Yonggui Zhu. Resampling detection of recompressed images via dualstream convolutional neural network. arXiv preprint arXiv:1901.04637, 2019.
5. Aman Chadha, John Britto, and M Mani Roja. iseebetter: Spatiotemporal video super-resolution using recurrent generative backprojection networks. Computational Visual Media, 6(3):307-317, 2020. [6] Ting Chen, Simon Kornblith, Mohammad Norouzi, and Geoffrey Hinton. A simple framework for contrastive learning of visual representations. In International conference on machine learning, pages 1597-1607. PMLR, 2020. https://doi.org/10.1007/s41095-020-0175-7