DATRA-MIV: Decoder-Adaptive Tiling and Rate Allocation for MPEG Immersive Video

Author:

Jeong Jong-Beom1ORCID,Lee Soonbin2ORCID,Ryu Eun-Seok3ORCID

Affiliation:

1. Sungkyunkwan University (SKKU),, Seoul, Republic of Korea

2. Fraunhofer Heinrich-Hertz-Institute (HHI), Berlin, Germany

3. Sungkyunkwan University (SKKU), Seoul, Republic of Korea

Abstract

The emerging immersive video coding standard moving picture experts group (MPEG) immersive video (MIV), which is ongoing standardization by MPEG-Immersive (MPEG-I) group, enables six degrees of freedom in a virtual reality environment that represents both natural and computer-generated scenes using multi-view video compression. The MIV eliminates the redundancy between multi-view videos and merges the residuals into multiple pictures, called an atlas. Thus, bitstreams with encoded atlases are generated and corresponding number of decoders are needed, which is challenging for the lightweight device with a single decoder. This article proposes a decoder-adaptive tiling and rate allocation method for MIV to overcome the challenge. First, the proposed method divides atlases into subpictures considering two aspects: (i) subpicture bitstream extracting and merging into one bitstream to use a single decoder and (ii) separation of each source view from the atlases for rate allocation. Second, the atlases are encoded by versatile video coding (VVC), using an extractable subpicture to divide the atlases into subpictures. Third, each subpicture bitstream is extracted, and asymmetric quality allocation for each subpictures is conducted by considering the residuals in the subpicture. Fourth, mixed-quality subpictures were merged by using the proposed bitstream merger. Fifth, the merged bitstream is decoded by using a single decoder. Finally, the viewing area of the user is synthesized by using the reconstructed atlases. Experimental results with the VVC test model (VTM) show that the proposed method achieves a 21.37% Bjøntegaard delta rate saving for immersive video peak signal-to-noise ratio and a 26.76% decoding runtime saving compared to the VTM anchor configuration. Moreover, it supports bitstreams for multiple decoders and single decoder without re-encoding, transcoding, or a substantial increase of the server-side storage.

Funder

Institute of Information & Communications Technology Planning & Evaluation

Publisher

Association for Computing Machinery (ACM)

Reference34 articles.

1. Comparing the Gear VR, Oculus Go, and Oculus Quest;Hillmann Cornel;Unreal for Mobile and Standalone VR,2019

2. Robert Skupin Yago Sanchez Karsten Sühring Thomas Schierl Eun-Seok Ryu and Jangwoo Son. 2018. Temporal MCTS coding constraints implementation. In Proceedings of the 122nd MPEG Meeting of ISO/IEC JTC1/SC29/WG11 (MPEG’18).

3. Implementing Motion-Constrained Tile and Viewport Extraction for VR Streaming

4. Overview of the High Efficiency Video Coding (HEVC) Standard

5. Overview of the Versatile Video Coding (VVC) Standard and its Applications

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Real-life Spatial Volumetric Video Acquisition and Encoding System;JOURNAL OF BROADCAST ENGINEERING;2024-07-31

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3