A Hybrid Contrast and Texture Masking Model to Boost High Efficiency Video Coding Perceptual Rate-Distortion Performance-Reference-Cited by-同舟云学术

A Hybrid Contrast and Texture Masking Model to Boost High Efficiency Video Coding Perceptual Rate-Distortion Performance

Published:2024-08-22 Issue:16 Volume:13 Page:3341
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Atencia Javier Ruiz¹^ORCID,López-Granado Otoniel¹^ORCID,Pérez Malumbres Manuel¹^ORCID,Martínez-Rach Miguel¹^ORCID,Coll Damian Ruiz²^ORCID,Fernández Escribano Gerardo³^ORCID,Van Wallendael Glenn⁴^ORCID

Affiliation:

1. Department Computer Engineering, Miguel Hernández University, 03202 Elche, Spain

2. Department of Signal and Communications Theory, Rey Juan Carlos University, 28933 Madrid, Spain

3. School of Industrial Engineering, University of Castilla-La Mancha, 13001 Albacete, Spain

4. IDLab-MEDIA, Ghent University—IMEC, B-9052 Ghent, Belgium

Abstract

As most of the videos are destined for human perception, many techniques have been designed to improve video coding based on how the human visual system perceives video quality. In this paper, we propose the use of two perceptual coding techniques, namely contrast masking and texture masking, jointly operating under the High Efficiency Video Coding (HEVC) standard. These techniques aim to improve the subjective quality of the reconstructed video at the same bit rate. For contrast masking, we propose the use of a dedicated weighting matrix for each block size (from 4×4 up to 32×32), unlike the HEVC standard, which only defines an 8×8 weighting matrix which it is upscaled to build the 16×16 and 32×32 weighting matrices (a 4×4 weighting matrix is not supported). Our approach achieves average Bjøntegaard Delta-Rate (BD-rate) gains of between 2.5% and 4.48%, depending on the perceptual metric and coding mode used. On the other hand, we propose a novel texture masking scheme based on the classification of each coding unit to provide an over-quantization depending on the coding unit texture level. Thus, for each coding unit, its mean directional variance features are computed to feed a support vector machine model that properly predicts the texture type (plane, edge, or texture). According to this classification, the block’s energy, the type of coding unit, and its size, an over-quantization value is computed as a QP offset (DQP) to be applied to this coding unit. By applying both techniques in the HEVC reference software, an overall average of 5.79% BD-rate gain is achieved proving their complementarity.

Publisher

MDPI AG

Link

https://www.mdpi.com/2079-9292/13/16/3341/pdf

Reference43 articles.

1. Image quality assessment and human visual system;Gao;Proceedings of the Visual Communications and Image Processing 2010,2010

2. The effects of a visual fidelity criterion of the encoding of images;Mannos;IEEE Trans. Inf. Theory,1974

3. A visual model weighted cosine transform for image compression and quality assessment;Nill;IEEE Trans. Commun.,1985

4. Daly, S. (1987). Subroutine for the Generation of a Two Dimensional Human Visual Contrast Sensitivity Function, Eastman Kodak. Technical Report Y, 233203.

5. Adaptive cosine transform coding of images in perceptual domain;Ngan;IEEE Trans. Acoust. Speech Signal Process.,1989