Fast Mode Decision Method of Multiple Weighted Bi-Predictions Using Lightweight Multilayer Perceptron in Versatile Video Coding

Author:

Lee Taesik1,Jun Dongsan1

Affiliation:

1. Department of Computer Engineering, Dong-A University, Busan 49315, Republic of Korea

Abstract

Versatile Video Coding (VVC), the state-of-the-art video coding standard, was developed by the Joint Video Experts Team (JVET) of ISO/IEC Moving Picture Experts Group (MPEG) and ITU-T Video Coding Experts Group (VCEG) in 2020. Although VVC can provide powerful coding performance, it requires tremendous computational complexity to determine the optimal mode decision during the encoding process. In particular, VVC adopted the bi-prediction with CU-level weight (BCW) as one of the new tools, which enhanced the coding efficiency of conventional bi-prediction by assigning different weights to the two prediction blocks in the process of inter prediction. In this study, we investigate the statistical characteristics of input features that exhibit a correlation with the BCW and define four useful types of categories to facilitate the inter prediction of VVC. With the investigated input features, a lightweight neural network with multilayer perceptron (MLP) architecture is designed to provide high accuracy and low complexity. We propose a fast BCW mode decision method with a lightweight MLP to reduce the computational complexity of the weighted multiple bi-prediction in the VVC encoder. The experimental results show that the proposed method significantly reduced the BCW encoding complexity by up to 33% with unnoticeable coding loss, compared to the VVC test model (VTM) under the random-access (RA) configuration.

Funder

Dong-A University research fund

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Reference30 articles.

1. Overview of the versatile video coding (VVC) standard and its applications;Bross;IEEE Trans. Circuits Syst. Video Technol.,2021

2. (2023, April 17). Versatile Video Coding Test Model (VTM) Reference Software of the JVET of ITU-T VCEG and ISO/IEC MPEG. Available online: https://vcgit.hhi.fraungoefer.de/jvet/VVCSoftware_VTM.

3. Overview of the high efficiency video coding (HEVC) standard;Sullivan;IEEE Trans. Circuits Syst. Video Technol.,2012

4. Bossen, F., Boyce, J., Suhring, K., Li, X., and Seregin, V. (2020, January 7–16). VTM common test conditions and software reference configurations for SDR video. Joint Video Experts Team (JVET) of ITU-T ISO/IEC, Document JVET-T2010. Proceedings of the 20th Meeting, Teleconference.

5. He, Y., Luo, J., Xiu, X., and Ye, Y. (2018, January 3–12). CE4-related: Generalized bi-prediction improvements combined from JVET-L0197 and JVET-L0296. Joint Video Experts Team (JVET) of ITU-T ISO/IEC, Document JVET-L0646. Proceedings of the 12th Meeting, Macao, China.

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3