GenMM: Geometrically and Temporally Consistent Multimodal Data Generation for Video and LiDAR

编辑：广东客 | 分类：CV | 2024年12月16日

Note: We don't have the ability to review paper

PubDate: June 2024

Teams:Cruise LLC

Writers: Bharat Singh, Viveka Kulharia, Luyu Yang, Avinash Ravichandran, Ambrish Tyagi, Ashish Shrivastava

PDF:GenMM: Geometrically and Temporally Consistent Multimodal Data Generation for Video and LiDAR

Abstract

Multimodal synthetic data generation is crucial in domains such as autonomous driving, robotics, augmented/virtual reality, and retail. We propose a novel approach, GenMM, for jointly editing RGB videos and LiDAR scans by inserting temporally and geometrically consistent 3D objects. Our method uses a reference image and 3D bounding boxes to seamlessly insert and blend new objects into target videos. We inpaint the 2D Regions of Interest (consistent with 3D boxes) using a diffusion-based video inpainting model. We then compute semantic boundaries of the object and estimate it's surface depth using state-of-the-art semantic segmentation and monocular depth estimation techniques. Subsequently, we employ a geometry-based optimization algorithm to recover the 3D shape of the object's surface, ensuring it fits precisely within the 3D bounding box. Finally, LiDAR rays intersecting with the new object surface are updated to reflect consistent depths with its geometry. Our experiments demonstrate the effectiveness of GenMM in inserting various 3D objects across video and LiDAR modalities.

本文链接：https://paper.nweon.com/16118

GenMM: Geometrically and Temporally Consistent Multimodal Data Generation for Video and LiDAR

您可能还喜欢...

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘

GenMM: Geometrically and Temporally Consistent Multimodal Data Generation for Video and LiDAR

您可能还喜欢...

Ego-Exo: Transferring Visual Representationsfrom Third-person to First-person Videos

Polarmask-Tracker: Lightweight Multi-Object Tracking and Segmentation Model for Edge Device

Pose-NDF: Modeling Human Pose Manifolds with Neural Distance Fields

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘