MonoHuman: Animatable Human Neural Field from Monocular Video

编辑：映维 | 分类：CV / XR | 2023年4月11日

Note: We don't have the ability to review paper

PubDate: Apr 2023

Teams: SenseTime Research；Shanghai AI Laboratory；The Chinese University of Hong Kon

Writers: Zhengming Yu, Wei Cheng, Xian Liu, Wayne Wu, Kwan-Yee Lin

PDF: MonoHuman: Animatable Human Neural Field from Monocular Video

MonoHuman: Animatable Human Neural Field from Monocular Video

Abstract

Animating virtual avatars with free-view control is crucial for various applications like virtual reality and digital entertainment. Previous studies have attempted to utilize the representation power of the neural radiance field (NeRF) to reconstruct the human body from monocular videos. Recent works propose to graft a deformation network into the NeRF to further model the dynamics of the human neural field for animating vivid human motions. However, such pipelines either rely on pose-dependent representations or fall short of motion coherency due to frame-independent optimization, making it difficult to generalize to unseen pose sequences realistically. In this paper, we propose a novel framework MonoHuman, which robustly renders view-consistent and high-fidelity avatars under arbitrary novel poses. Our key insight is to model the deformation field with bi-directional constraints and explicitly leverage the off-the-peg keyframe information to reason the feature correlations for coherent results. Specifically, we first propose a Shared Bidirectional Deformation module, which creates a pose-independent generalizable deformation field by disentangling backward and forward deformation correspondences into shared skeletal motion weight and separate non-rigid motions. Then, we devise a Forward Correspondence Search module, which queries the correspondence feature of keyframes to guide the rendering network. The rendered results are thus multi-view consistent with high fidelity, even under challenging novel pose settings. Extensive experiments demonstrate the superiority of our proposed MonoHuman over state-of-the-art methods.

本文链接：https://paper.nweon.com/14270

MonoHuman: Animatable Human Neural Field from Monocular Video

您可能还喜欢...

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘

MonoHuman: Animatable Human Neural Field from Monocular Video

您可能还喜欢...

Blind Identification of Binaural Room Impulse Responses from Smart Glasses

Chip-scale blue light phased array

Using Eye Tracking to Improve Information Retrieval in Virtual Reality

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘