Transcoding V-PCC Point Cloud Streams in Real-time-Reference-Cited by-同舟云学术

Transcoding V-PCC Point Cloud Streams in Real-time

Published:2024-08 Issue: Volume: Page:
ISSN:1551-6857
Container-title:ACM Transactions on Multimedia Computing, Communications, and Applications
language:en
Short-container-title:ACM Trans. Multimedia Comput. Commun. Appl.

Author:

Rudolph Michael¹^ORCID,Schneegass Stefan²^ORCID,Rizk Amr³^ORCID

Affiliation:

1. University of Duisburg-Essen, Germany and Leibniz University Hannover, Germany

2. University of Duisburg-Essen, Germany

3. Leibniz University Hannover, Germany

Abstract

Dynamic Point Clouds are a representation for 3D immersive media that allows users to freely navigate a scene while consuming the content. However, this comes at the cost of substantial data size, requiring efficient compression techniques to make point cloud videos accessible. Addressing this, Video-based Point Cloud Compression (V-PCC) projects points into 2D patches to compress video frames, leveraging the high compression efficiency of legacy video codecs and exploiting temporal correlations in the 2D images. However, clustering and projecting points into meaningful 2D patches is computationally intensive, leading to high encoding latency in V-PCC. Applying adaptive streaming techniques, originating from traditional video streaming, multiplies the computational effort as multiple encodings of the same content are required. In this light, transcoding a compressed representation into lower qualities for dynamic adaptation to user requirements is gaining popularity. To address the high latency when employing the full decoder-encoder stack of V-PCC during transcoding, we propose RABBIT, a novel technique that only re-encodes the underlying video sub-streams. This is in contrast to slow V-PCC transcoding that reconstructs and re-encodes the raw point cloud at a new quality setting. By eliminating expensive overhead resulting from calculations based on the 3D space representation, the latency of RABBIT is bounded by the latency of transcoding the underlying video streams, allowing optimized video codec implementations to be used to meet the real time requirements of adaptive streaming systems. Our evaluations of RABBIT, using various optimized video codec implementations, shows on-par quality with the baseline V-PCC transcoding given a high-quality representation. Given unicast or multicast distribution of a point cloud stream and in-network or edge transcoders, our evaluations show the tradeoff between rate-distortion performance and the required network bandwidth.

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/3682062

Reference51 articles.

1. ISO/IEC JTC 1/SC 29. 2021. ISO/IEC 23090-5:2021 Information technology — Coded representation of immersive media — Part 5: Visual volumetric video-based coding (V3C) and video-based point cloud compression (V-PCC). ISO/IEC.

2. ISO/IEC JTC 1/SC 29. 2022. ISO/IEC 23009-1:2022 Information technology — Dynamic adaptive streaming over HTTP (DASH) — Part 1: Media presentation description and segment formats. ISO/IEC.

3. FastTTPS: fast approach for video transcoding time prediction and scheduling for HTTP adaptive streaming videos

4. RATS

5. Divyashri Bhat, Amr Rizk, Michael Zink, and Ralf Steinmetz. 2017. Network Assisted Content Distribution for Adaptive Bitrate Video Streaming. In ACM on Multimedia Systems Conference (MMSys). 62–75.