A Hybrid Contrast and Texture Masking Model to Boost High Efficiency Video Coding Perceptual Rate-Distortion Performance

Author:

Atencia Javier Ruiz1ORCID,López-Granado Otoniel1ORCID,Pérez Malumbres Manuel1ORCID,Martínez-Rach Miguel1ORCID,Coll Damian Ruiz2ORCID,Fernández Escribano Gerardo3ORCID,Van Wallendael Glenn4ORCID

Affiliation:

1. Department Computer Engineering, Miguel Hernández University, 03202 Elche, Spain

2. Department of Signal and Communications Theory, Rey Juan Carlos University, 28933 Madrid, Spain

3. School of Industrial Engineering, University of Castilla-La Mancha, 13001 Albacete, Spain

4. IDLab-MEDIA, Ghent University—IMEC, B-9052 Ghent, Belgium

Abstract

As most of the videos are destined for human perception, many techniques have been designed to improve video coding based on how the human visual system perceives video quality. In this paper, we propose the use of two perceptual coding techniques, namely contrast masking and texture masking, jointly operating under the High Efficiency Video Coding (HEVC) standard. These techniques aim to improve the subjective quality of the reconstructed video at the same bit rate. For contrast masking, we propose the use of a dedicated weighting matrix for each block size (from 4×4 up to 32×32), unlike the HEVC standard, which only defines an 8×8 weighting matrix which it is upscaled to build the 16×16 and 32×32 weighting matrices (a 4×4 weighting matrix is not supported). Our approach achieves average Bjøntegaard Delta-Rate (BD-rate) gains of between 2.5% and 4.48%, depending on the perceptual metric and coding mode used. On the other hand, we propose a novel texture masking scheme based on the classification of each coding unit to provide an over-quantization depending on the coding unit texture level. Thus, for each coding unit, its mean directional variance features are computed to feed a support vector machine model that properly predicts the texture type (plane, edge, or texture). According to this classification, the block’s energy, the type of coding unit, and its size, an over-quantization value is computed as a QP offset (DQP) to be applied to this coding unit. By applying both techniques in the HEVC reference software, an overall average of 5.79% BD-rate gain is achieved proving their complementarity.

Publisher

MDPI AG

Reference43 articles.

1. Image quality assessment and human visual system;Gao;Proceedings of the Visual Communications and Image Processing 2010,2010

2. The effects of a visual fidelity criterion of the encoding of images;Mannos;IEEE Trans. Inf. Theory,1974

3. A visual model weighted cosine transform for image compression and quality assessment;Nill;IEEE Trans. Commun.,1985

4. Daly, S. (1987). Subroutine for the Generation of a Two Dimensional Human Visual Contrast Sensitivity Function, Eastman Kodak. Technical Report Y, 233203.

5. Adaptive cosine transform coding of images in perceptual domain;Ngan;IEEE Trans. Acoust. Speech Signal Process.,1989

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3