Multitask Learning Based Intra-Mode Decision Framework for Versatile Video Coding

Author:

Zouidi NaimaORCID,Kessentini Amina,Hamidouche Wassim,Masmoudi Nouri,Menard Daniel

Abstract

In mid-2020, the new international video coding standard, namely versatile video coding (VVC), was officially released by the Joint Video Expert Team (JVET). As its name indicates, the VVC enables a higher level of versatility with better compression performance compared to its predecessor, high-efficiency video coding (HEVC). VVC introduces several new coding tools like multiple reference lines (MRL) and matrix-weighted intra-prediction (MIP), along with several improvements on the block-based hybrid video coding scheme such as quatree with nested multi-type tree (QTMT) and finer-granularity intra-prediction modes (IPMs). Because finding the best encoding decisions is usually preceded by optimizing the rate distortion (RD) cost, introducing new coding tools or enhancing existing ones requires additional computations. In fact, the VVC is 31 times more complex than the HEVC. Therefore, this paper aims to reduce the computational complexity of the VVC. It establishes a large database for intra-prediction and proposes a multitask learning (MTL)-based intra-mode decision framework. Experimental results show that our proposal enables up to 30% of complexity reduction while slightly increasing the Bjontegaard bit rate (BD-BR).

Funder

PHC Maghreb

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Reference33 articles.

1. cisco (2020, September 01). Cisco Visual Networking Index: Forecast and Trends 2017–2022. Available online: https://cloud.report/whitepapers/.

2. JVET (2021, March 01). AHG Report: Test Model Software Development (AHG3). Available online: https://jvet-experts.org/.

3. Overview of the Versatile Video Coding (VVC) Standard and its Applications;Bross;IEEE Trans. Circuits Syst. Video Technol.,2021

4. An improved framework of affine motion compensation in video coding;Zhang;IEEE Trans. Image Process.,2018

5. JVET (2020, January 17). Algorithm description for Versatile Video Coding and Test Model 8. Proceedings of the 17th Meeting ITU-T SG 16 WP 3 and ISO/IEC JTC 1/SC 29/WG 11, Brussels, Belgium.

Cited by 4 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Fast rough mode decision algorithm and hardware architecture design for AV1 encoder;Journal of Real-Time Image Processing;2024-09-12

2. VVC Intra Coding Complexity Optimization Based on Early Skipping of the Secondary Transform;IEEE Signal Processing Letters;2024

3. The Complexity Optimization of Dependent Quantization in VVC By Delaying Starting Point;2023 IEEE International Conference on Visual Communications and Image Processing (VCIP);2023-12-04

4. Visual Perception Based Intra Coding Algorithm for H.266/VVC;Electronics;2023-05-01

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3