An Efficient Recurrent Adversarial Framework for Unsupervised Real-Time Video Enhancement-Reference-Cited by-同舟云学术

An Efficient Recurrent Adversarial Framework for Unsupervised Real-Time Video Enhancement

Published:2023-01-08 Issue:4 Volume:131 Page:1042-1059
ISSN:0920-5691
Container-title:International Journal of Computer Vision
language:en
Short-container-title:Int J Comput Vis

Author:

Fuoli Dario^ORCID,Huang Zhiwu,Paudel Danda Pani,Van Gool Luc,Timofte Radu

Abstract

AbstractVideo enhancement is a challenging problem, more than that of stills, mainly due to high computational cost, larger data volumes and the difficulty of achieving consistency in the spatio-temporal domain. In practice, these challenges are often coupled with the lack of example pairs, which inhibits the application of supervised learning strategies. To address these challenges, we propose an efficient adversarial video enhancement framework that learns directly from unpaired video examples. In particular, our framework introduces new recurrent cells that consist of interleaved local and global modules for implicit integration of spatial and temporal information. The proposed design allows our recurrent cells to efficiently propagate spatio-temporal information across frames and reduces the need for high complexity networks. Our setting enables learning from unpaired videos in a cyclic adversarial manner, where the proposed recurrent units are employed in all architectures. Efficient training is accomplished by introducing one single discriminator that learns the joint distribution of source and target domain simultaneously. The enhancement results demonstrate clear superiority of the proposed video enhancer over the state-of-the-art methods, in all terms of visual quality, quantitative metrics, and inference speed. Notably, our video enhancer is capable of enhancing over 35 frames per second of FullHD video (1080x1920).

Funder

Swiss Federal Institute of Technology Zurich

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence,Computer Vision and Pattern Recognition,Software

Link

https://link.springer.com/content/pdf/10.1007/s11263-022-01735-0.pdf

Reference61 articles.

1. Aittala, M., & Durand, F. (2018). Burst image deblurring using permutation invariant convolutional neural networks. In Proceedings of the European conference on computer vision (ECCV) (pp. 731–747).

2. Baker, S., Scharstein, D., Lewis, J. P., Roth, S., Black, M. J., & Szeliski, R. (2011). A database and evaluation methodology for optical flow. International Journal of Computer Vision, 92(1), 1–31.

3. Bansal, A., Ma, S., Ramanan, D., & Yaser, S. (2018). Recycle-gan: Unsupervised video retargeting. In ECCV.

4. Chen, Y., Pan, Y., Yao, T., Tian, X., & Mei, T. (2019). Mocycle-gan: Unpaired video-to-video translation. In Proceedings of the 27th ACM international conference on multimedia, MM ’19, 647–655, New York, NY, USA. Association for Computing Machinery.

5. Chen, Y.-S., Wang, Y.-C., Kao, M.-H., & Chuang, Y.-Y. (2018). Deep photo enhancer: Unpaired learning for image enhancement from photographs with gans. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 6306–6314).

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Stable Viewport-Based Unsupervised Compressed 360° Video Quality Enhancement;IEEE Transactions on Broadcasting;2024-06

2. A novel method for video enhancement under low light using BFR-SEQT technique;The Imaging Science Journal;2024-02-13