Temporally Consistent Enhancement of Low-Light Videos via Spatial-Temporal Compatible Learning-Reference-Cited by-同舟云学术

Temporally Consistent Enhancement of Low-Light Videos via Spatial-Temporal Compatible Learning

Published:2024-05-23 Issue: Volume: Page:
ISSN:0920-5691
Container-title:International Journal of Computer Vision
language:en
Short-container-title:Int J Comput Vis

Author:

Zhu Lingyu,Yang Wenhan,Chen Baoliang,Zhu Hanwei,Meng Xiandong,Wang Shiqi^ORCID

Abstract

AbstractTemporal inconsistency is the annoying artifact that has been commonly introduced in low-light video enhancement, but current methods tend to overlook the significance of utilizing both data-centric clues and model-centric design to tackle this problem. In this context, our work makes a comprehensive exploration from the following three aspects. First, to enrich the scene diversity and motion flexibility, we construct a synthetic diverse low/normal-light paired video dataset with a carefully designed low-light simulation strategy, which can effectively complement existing real captured datasets. Second, for better temporal dependency utilization, we develop a Temporally Consistent Enhancer Network (TCE-Net) that consists of stacked 3D convolutions and 2D convolutions to exploit spatial-temporal clues in videos. Last, the temporal dynamic feature dependencies are exploited to obtain consistency constraints for different frame indexes. All these efforts are powered by a Spatial-Temporal Compatible Learning (STCL) optimization technique, which dynamically constructs specific training loss functions adaptively on different datasets. As such, multiple-frame information can be effectively utilized and different levels of information from the network can be feasibly integrated, thus expanding the synergies on different kinds of data and offering visually better results in terms of illumination distribution, color consistency, texture details, and temporal coherence. Extensive experimental results on various real-world low-light video datasets clearly demonstrate the proposed method achieves superior performance to state-of-the-art methods. Our code and synthesized low-light video database will be publicly available at https://github.com/lingyzhu0101/low-light-video-enhancement.git.

Funder

City University of Hong Kong

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s11263-024-02084-w.pdf

Reference70 articles.

1. Abdullah-Al-Wadud, M., Kabir, M. H., Dewan, M. A. A., et al. (2007). A dynamic histogram equalization for image contrast enhancement. IEEE Transactions on Consumer Electronics, 53(2), 593–600.

2. Ai, S., & Kwon, J. (2020). Extreme low-light image enhancement for surveillance cameras using attention u-net. Sensors, 20(2), 495.

3. Arici, T., Dikbas, S., & Altunbasak, Y. (2009). A histogram modification framework and its application for image contrast enhancement. IEEE Transactions on Image Processing, 18(9), 1921–1935.

4. Brooks, T., Mildenhall, B., Xue, T., & et al. (2019). Unprocessing images for learned raw denoising. In Proceedings of the IEEE/CVF Conference on computer vision and pattern recognition (pp. 11036–11045)

5. Bychkovsky, V., Paris, S., Chan, E., et al. (2011). Learning photographic global tonal adjustment with a database of input/output image pairs. In CVPR 2011. IEEE (pp. 97–104).