Implicit-explicit Integrated Representations for Multi-view Video Compression

编辑：广东客 | 分类：CV | 2023年12月19日

Note: We don't have the ability to review paper

PubDate: Dec 2023

Teams:Shanghai Jiao Tong University

Writers: Chen Zhu, Guo Lu, Bing He, Rong Xie, Li Song

PDF:Implicit-explicit Integrated Representations for Multi-view Video Compression

Abstract

With the increasing consumption of 3D displays and virtual reality, multi-view video has become a promising format. However, its high resolution and multi-camera shooting result in a substantial increase in data volume, making storage and transmission a challenging task. To tackle these difficulties, we propose an implicit-explicit integrated representation for multi-view video compression. Specifically, we first use the explicit representation-based 2D video codec to encode one of the source views. Subsequently, we propose employing the implicit neural representation (INR)-based codec to encode the remaining views. The implicit codec takes the time and view index of multi-view video as coordinate inputs and generates the corresponding implicit reconstruction this http URL enhance the compressibility, we introduce a multi-level feature grid embedding and a fully convolutional architecture into the implicit codec. These components facilitate coordinate-feature and feature-RGB mapping, respectively. To further enhance the reconstruction quality from the INR codec, we leverage the high-quality reconstructed frames from the explicit codec to achieve inter-view compensation. Finally, the compensated results are fused with the implicit reconstructions from the INR to obtain the final reconstructed frames. Our proposed framework combines the strengths of both implicit neural representation and explicit 2D codec. Extensive experiments conducted on public datasets demonstrate that the proposed framework can achieve comparable or even superior performance to the latest multi-view video compression standard MIV and other INR-based schemes in terms of view compression and scene modeling.

本文链接：https://paper.nweon.com/15047

Implicit-explicit Integrated Representations for Multi-view Video Compression

您可能还喜欢...

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘

Implicit-explicit Integrated Representations for Multi-view Video Compression

您可能还喜欢...

NeuWigs: A Neural Dynamic Model for Volumetric Hair Capture and Animation

Real-time Hair Segmentation and Recoloring on Mobile GPUs

Full-Body Motion Reconstruction with Sparse Sensing from Graph Perspective

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘