Machine Learning Based Fast QTMTT Partitioning Strategy for VVenC Encoder in Intra Coding

Author:

Taabane Ibrahim12ORCID,Menard Daniel1ORCID,Mansouri Anass3ORCID,Ahaitouf Ali2ORCID

Affiliation:

1. IETR, UMR CNRS 6164, Electronics and Computer Engineering Department, INSA Rennes, University of Rennes, 35000 Rennes, France

2. Laboratory of Intelligent Systems, Geo-Resources and Renewable Energies, Faculty of Sciences and Technologies, Sidi Mohamed Ben Abdellah University, Fez 30000, Morocco

3. Laboratory of Intelligent Systems, Geo-Resources and Renewable Energies, National School of Applied Sciences, Sidi Mohamed Ben Abdellah University, Fez 30000, Morocco

Abstract

The newest video compression standard, Versatile Video Coding (VVC), was finalized in July 2020 by the Joint Video Experts Team (JVET). Its main goal is to reduce the bitrate by 50% over its predecessor video coding standard, the High Efficiency Video Coding (HEVC). Due to the new advanced tools and features included in VVC, it actually provides high coding performances—for instance, the Quad Tree with nested Multi-Type Tree (QTMTT) involved in the partitioning block. Furthermore, VVC introduces various techniques that allow for superior performance compared to HEVC, but with an increase in the computational complexity. To tackle this complexity, a fast Coding Unit partition algorithm based on machine learning for the intra configuration in VVC is proposed in this work. The proposed algorithm is formed by five binary Light Gradient Boosting Machine (LightGBM) classifiers, which can directly predict the most probable split mode for each coding unit without passing through the exhaustive process known as Rate Distortion Optimization (RDO). These LightGBM classifiers were offline trained on a large dataset; then, they were embedded on the optimized implementation of VVC known as VVenC. The results of our experiment show that our proposed approach has good trade-offs in terms of time-saving and coding efficiency. Depending on the preset chosen, our approach achieves an average time savings of 30.21% to 82.46% compared to the VVenC encoder anchor, and a Bjøntegaard Delta Bitrate (BDBR) increase of 0.67% to 3.01%, respectively.

Funder

Campus France

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Reference39 articles.

1. (2022, July 01). Cisco Annual Internet Report—Cisco Annual Internet Report (2018–2023) White Paper. Available online: https://www.cisco.com/c/en/us/solutions/collateral/executive-perspectives/annual-internet-report/white-paper-c11-741490.html.

2. Overview of the high efficiency video coding (HEVC) standard;Sullivan;IEEE Trans. Circuits Syst. Video Technol.,2012

3. Overview of the Versatile Video Coding (VVC) Standard and its Applications;Bross;IEEE Trans. Circuits Syst. Video Technol.,2021

4. Forward-inverse 2D hardware implementation of approximate transform core for the VVC standard;Kammoun;IEEE Trans. Circuits Syst. Video Technol.,2019

5. Farhat, I., Hamidouche, W., Grill, A., Menard, D., and Déforges, O. (2020, January 4–8). Lightweight hardware implementation of VVC transform block for ASIC decoder. Proceedings of the ICASSP 2020—2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain.

Cited by 3 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3