Enhancing Bandwidth Efficiency for Video Motion Transfer Applications using Deep Learning Based Keypoint Prediction

编辑：广东客 | 分类：XR | 2024年8月15日

Note: We don't have the ability to review paper

PubDate: March 2024

Teams: West Virginia University, Morgantown, WV, 26506, USA2Intel Corporation, Santa Clara, CA, USA3University of Southern California, Los Angeles, CA, 90089, USA4Coupa Software, San Mateo

Writers: Xue Bai1, Tasmiah Haque1, Sumit Mohan2, Yuliang Cai3, Byungheon Jeong4, Ad´am Hal´asz ´ 1,and Srinjoy Das

PDF: Enhancing Bandwidth Efficiency for Video Motion Transfer Applications using Deep Learning Based Keypoint Prediction

Abstract

We propose a deep learning based novel prediction framework for enhanced bandwidth reduction in motion transfer enabled video applications such as video conferencing, virtual reality gaming and privacy preservation for patient health monitoring. To model complex motion, we use the First Order Motion Model (FOMM) that represents dynamic objects using learned keypoints along with their local affine transformations. Keypoints are extracted by a self-supervised keypoint detector and organized in a time series corresponding to the video frames. Prediction of keypoints, to enable transmission using lower frames per second on the source device, is performed using a Variational Recurrent Neural Network (VRNN). The predicted keypoints are then synthesized to video frames using an optical flow estimator and a generator network. This efficacy of leveraging keypoint based representations in conjunction with VRNN based prediction for both video animation and reconstruction is demonstrated on three diverse datasets. For real-time applications, our results show the effectiveness of our proposed architecture by enabling up to 2x additional bandwidth reduction over existing keypoint based video motion transfer frameworks without significantly compromising video quality

本文链接：https://paper.nweon.com/16015

Enhancing Bandwidth Efficiency for Video Motion Transfer Applications using Deep Learning Based Keypoint Prediction

您可能还喜欢...

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘

Enhancing Bandwidth Efficiency for Video Motion Transfer Applications using Deep Learning Based Keypoint Prediction

您可能还喜欢...

User interface considerations to prevent self-driving carsickness

Continuous Face Aging via Self-estimated Residual Age Embedding

Human Intention Estimation based on Hidden Markov Model Motion Validation for Safe Flexible Robotized Warehouses

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘