A Survey on Perceptually Optimized Video Coding

Author:

Zhang Yun1ORCID,Zhu Linwei2ORCID,Jiang Gangyi3ORCID,Kwong Sam4ORCID,Kuo C.-C. Jay5ORCID

Affiliation:

1. School of Electronics and Communication Engineering, Sun Yat-Sen University, Guangdong, China

2. Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Guangdong, China

3. Faculty of Information and Engineering, Ningbo University, Zhejiang, China

4. Department of Computer Science, City University of Hong Kong, Hong Kong, China

5. University of Southern California, California, USA

Abstract

To provide users with more realistic visual experiences, videos are developing in the trends of Ultra High Definition (UHD), High Frame Rate (HFR), High Dynamic Range (HDR), Wide Color Gammut (WCG), and high clarity. However, the data amount of videos increases exponentially, which requires high efficiency video compression for storage and network transmission. Perceptually optimized video coding aims to maximize compression efficiency by exploiting visual redundancies. In this article, we present a broad and systematic survey on perceptually optimized video coding. Firstly, we present problem formulation and framework of the perceptually optimized video coding, which includes visual perception modeling, visual quality assessment, and perceptual video coding optimization. Secondly, recent advances on visual factors, computational perceptual models, and quality assessment models are presented. Thirdly, we review perceptual video coding optimizations from four key aspects, including perceptually optimized bit allocation, rate-distortion optimization, transform and quantization, and filtering and enhancement. In each part, problem formulation, working flow, recent advances, advantages, and challenges are presented. Fourthly, perceptual coding performances of the latest coding standards and tools are experimentally analyzed. Finally, challenging issues and future opportunities are identified.

Funder

National Natural Science Foundation of China

Shenzhen Science and Technology Program

Guangdong Basic and Applied Basic Research Foundation

CAS President’s International Fellowship Initiative

Hong Kong Innovation and Technology Commission

Hong Kong GRF-RGC General Research Fund

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science,Theoretical Computer Science

Cited by 9 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Towards Instrumented Fingerprinting of Urban Traffic: A Novel Methodology using Distributed Mobile Point-of-View Cameras;Proceedings of the 16th International Conference on Automotive User Interfaces and Interactive Vehicular Applications;2024-09-11

2. A Loss-initiated GAN-based Convolutional LSTM Method for Compression and Motion Estimation-Based Objective Enhancement in Images and Videos;2024-06-06

3. Perceptual Quality-Oriented Rate Allocation via Distillation from End-to-End Image Compression;ACM Transactions on Multimedia Computing, Communications, and Applications;2024-04-25

4. Learning to Predict Object-Wise Just Recognizable Distortion for Image and Video Compression;IEEE Transactions on Multimedia;2024

5. A content-adaptive video compression method based on transformer;International Journal of Computational Science and Engineering;2024

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3