Affiliation:
1. School of Electrical Engineering Korea Advanced Institute of Science and Technology Daejeon Republic of Korea
2. Media Research Division Electronics and Telecommunications Research Institute Daejeon Republic of Korea
3. Space Technology Center Agency for Defense Development Daejeon Republic of Korea
Abstract
AbstractRecently, learned image compression methods based on entropy minimization have achieved superior results compared with conventional image codecs such as BPG and JPEG2000. However, they leverage single Gaussian models, which have a limited ability to approximate various irregular distributions of transformed latent representations, resulting in suboptimal coding efficiency. Furthermore, existing methods focus on constructing effective entropy models, rather than utilizing modern architectural techniques. In this paper, we propose a novel joint learning scheme called JointIQ‐Net that incorporates image compression and quality enhancement technologies with improved entropy minimization based on a newly adopted Gaussian mixture model. We also exploit global context to estimate the distributions of latent representations precisely. The results of extensive experiments demonstrate that JointIQ‐Net achieves remarkable performance improvements in terms of coding efficiency compared with existing learned image compression methods and conventional codecs. To the best of our knowledge, ours is the first learned image compression method that outperforms VVC intra‐coding in terms of both PSNR and MS‐SSIM.
Reference42 articles.
1. J.Ballé V.Laparra andE. P.Simoncelli End‐to‐end optimized image compression (5th Int. Conf. on Learning Representations Toulon France) 2017 DOI10.48550/arXiv.1611.01704.
2. J.Ballé D.Minnen S.Singh S. J.Hwang andN.Johnston Variational image compression with a scale hyperprior (6th Int. Conf. on Learning Representations Vancouver Canada) 2018 DOI10.48550/arXiv.1802.01436.
3. D.He Y.Zheng B.Sun Y.Wang andH.Qin Checkerboard context model for efficient learned image compression (Proc. IEEE/CVF Conf. Comput. Vision Pattern Recognit. Nashville TN USA) 2021 pp.14771–14780.
4. N.Johnston D.Vincent D.Minnen M.Covell S.Singh T.Chinen S.Jin Hwang J.Shor andG.Toderici Improved lossy image compression with priming and spatially adaptive bit rates for recurrent networks (IEEE Conf. Comput. Vision Pattern Recognit. (CVPR) Salt Lake City UT USA) 2018 DOI10.1109/CVPR.2018.00461.
5. J. P.Klopp Y.‐C. F.Wang S.‐Y.Chien andL.‐G.Chen Learning a code‐space predictor by exploiting intra‐image‐dependencies (British Mach. Vision Conf. Newcastle Upon Tyne UK) 2018.https://bmva‐archive.org.uk/bmvc/2018/contents/papers/0491.pdf.