Affiliation:
1. School of Computer Science, Nanjing University of Posts and Telecommunications, Nanjing 210023, China
Abstract
Recently, end-to-end deep models for video compression have made steady advancements. However, this resulted in a lengthy and complex pipeline containing numerous redundant parameters. The video compression approaches based on implicit neural representation (INR) allow videos to be directly represented as a function approximated by a neural network, resulting in a more lightweight model, whereas the singularity of the feature extraction pipeline limits the network’s ability to fit the mapping function for video frames. Hence, we propose a neural representation approach for video compression with an implicit multiscale fusion network (NRVC), utilizing normalized residual networks to improve the effectiveness of INR in fitting the target function. We propose the multiscale representations for video compression (MSRVC) network, which effectively extracts features from the input video sequence to enhance the degree of overfitting in the mapping function. Additionally, we propose the feature extraction channel attention (FECA) block to capture interaction information between different feature extraction channels, further improving the effectiveness of feature extraction. The results show that compared to the NeRV method with similar bits per pixel (BPP), NRVC has a 2.16% increase in the decoded peak signal-to-noise ratio (PSNR). Moreover, NRVC outperforms the conventional HEVC in terms of PSNR.
Funder
National Natural Science Foundation of China
Natural Science Foundation of Jiangsu Province
Open Research Project of Zhejiang Lab
Subject
General Physics and Astronomy
Reference44 articles.
1. Rong, Y., Zhang, X., and Lin, J. (2021). Modified Hilbert Curve for Rectangles and Cuboids and Its Application in Entropy Coding for Image and Video Compression. Entropy, 23.
2. Wang, W., Wang, J., and Chen, J. (2021). Adaptive block-based compressed video sensing based on saliency detection and side information. Entropy, 23.
3. Developments in international video coding standardization after AVC, with an overview of versatile video coding (VVC);Bross;Proc. IEEE,2021
4. Yang, R., Van Gool, L., and Timofte, R. (2020). OpenDVC: An open source implementation of the DVC video compression method. arXiv.
5. Sheng, X., Li, J., Li, B., Li, L., Liu, D., and Lu, Y. (2022). Temporal context mining for learned video compression. IEEE Trans. Multimed.