Learning Spatio-Temporal Downsampling for Effective Video Upscaling

编辑：映维 | 分类：CV / XR | 2023年1月31日

Note: We don't have the ability to review paper

PubDate: Oct 2022

Teams: Meta;University of Texas at Dallas

Writers: Xiaoyu Xiang, Yapeng Tian, Vijay Rengaranjan, Lucas D. Young, Bo Zhu, Rakesh Ranjan

PDF: Learning Spatio-Temporal Downsampling for Effective Video Upscaling

Learning Spatio-Temporal Downsampling for Effective Video Upscaling

Abstract

Downsampling is one of the most basic image processing operations. Improper spatio-temporal downsampling applied on videos can cause aliasing issues such as moiré patterns in space and the wagon-wheel effect in time. Consequently, the inverse task of upscaling a low-resolution, low frame-rate video in space and time becomes a challenging ill-posed problem due to information loss and aliasing artifacts. In this paper, we aim to solve the space-time aliasing problem by learning a spatio-temporal downsampler. Towards this goal, we propose a neural network framework that jointly learns spatio-temporal downsampling and upsampling. It enables the downsampler to retain the key patterns of the original video and maximizes the reconstruction performance of the upsampler. To make the downsamping results compatible with popular image and video storage formats, the downsampling results are encoded to uint8 with a differentiable quantization layer. To fully utilize the space-time correspondences, we propose two novel modules for explicit temporal propagation and space-time feature rearrangement. Experimental results show that our proposed method significantly boosts the space-time reconstruction quality by preserving spatial textures and motion patterns in both downsampling and upscaling. Moreover, our framework enables a variety of applications, including arbitrary video resampling, blurry frame reconstruction, and efficient video storage.

本文链接：https://paper.nweon.com/14026

Learning Spatio-Temporal Downsampling for Effective Video Upscaling

您可能还喜欢...

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘

Learning Spatio-Temporal Downsampling for Effective Video Upscaling

您可能还喜欢...

Egocentric Pose Estimation from Human Vision Span

Effect of Avatar Head Movement on Communication Behaviour, Experience of Presence and Conversation Success in Triadic Conversations

Stretchable Transducers for Kinesthetic Interactions in Virtual Reality

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘