Combining contrastive and supervised learning for video super-resolution detection-Reference-Cited by-同舟云学术

Combining contrastive and supervised learning for video super-resolution detection

Published:2022 Issue:80 Volume: Page:1-13
ISSN:2071-2898
Container-title:Keldysh Institute Preprints
language:
Short-container-title:KIAM Prepr.

Author:

Meshchaninov Viacheslav Pavlovich^ORCID,Molodetskikh Ivan Andreevich^ORCID,Vatolin Dmitriy Sergeevich^ORCID,Voloboy Alexey Gennadievich^ORCID

Abstract

Upscaled video detection is a helpful tool in multimedia forensics, but it’s a challenging task that involves various upscaling and compression algorithms. There are many resolution-enhancement methods, including interpolation and deep-learning based super-resolution, and they leave unique traces. This paper proposes a new upscaled-resolution-detection method based on learning of visual representations using contrastive and cross-entropy losses. To explain how the method detects videos, the major components of our framework are systematically reviewed — in particular, it is shown that most data-augmentation approaches hinder the learning of the method. Through extensive experiments on various datasets, our method has been shown to effectively detects upscaling even in compressed videos and outperforms the state-of-theart alternatives. The code and models are publicly available at https://github.com/msu-video-group/SRDM.

Publisher

Keldysh Institute of Applied Mathematics

Subject

General Medicine

Reference32 articles.

1. Topaz Gigapixel AI, 2021. https://www.topazlabs.com/gigapixel-ai.

2. Adrien Bardes, Jean Ponce, and Yann LeCun. Vicreg: Varianceinvariance-covariance regularization for self-supervised learning. arXiv preprint arXiv:2105.04906, 2021.

3. Jianrui Cai, Hui Zeng, Hongwei Yong, Zisheng Cao, and Lei Zhang. Toward real-world single image super-resolution: A new benchmark and a new model. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 3086-3095, 2019. https://doi.org/10.1109/iccv.2019.00318

4. Gang Cao, Antao Zhou, Xianglin Huang, Gege Song, Lifang Yang, and Yonggui Zhu. Resampling detection of recompressed images via dualstream convolutional neural network. arXiv preprint arXiv:1901.04637, 2019.

5. Aman Chadha, John Britto, and M Mani Roja. iseebetter: Spatiotemporal video super-resolution using recurrent generative backprojection networks. Computational Visual Media, 6(3):307-317, 2020. [6] Ting Chen, Simon Kornblith, Mohammad Norouzi, and Geoffrey Hinton. A simple framework for contrastive learning of visual representations. In International conference on machine learning, pages 1597-1607. PMLR, 2020. https://doi.org/10.1007/s41095-020-0175-7