Semantic Scene Completion with Multi-Feature Data Balancing Network

编辑：广东客 | 分类：CV | 2025年4月24日

Note: We don't have the ability to review paper

PubDate: Dec 2024

Teams:1University of Southampton,Imam Mohammad Ibn Saud Islamic University

Writers:Mona Alawadh, Mahesan Niranjan, Hansung Kim

PDF:Semantic Scene Completion with Multi-Feature Data Balancing Network

Abstract

Semantic Scene Completion (SSC) is a critical task in computer vision, that utilized in applications such as virtual reality (VR). SSC aims to construct detailed 3D models from partial views by transforming a single 2D image into a 3D representation, assigning each voxel a semantic label. The main challenge lies in completing 3D volumes with limited information, compounded by data imbalance, inter-class ambiguity, and intra-class diversity in indoor scenes. To address this, we propose the Multi-Feature Data Balancing Network (MDBNet), a dual-head model for RGB and depth data (F-TSDF) inputs. Our hybrid encoder-decoder architecture with identity transformation in a pre-activation residual module (ITRM) effectively manages diverse signals within F-TSDF. We evaluate RGB feature fusion strategies and use a combined loss function cross entropy for 2D RGB features and weighted cross-entropy for 3D SSC predictions. MDBNet results surpass comparable state-of-the-art (SOTA) methods on NYU datasets, demonstrating the effectiveness of our approach.

本文链接：https://paper.nweon.com/16301

Semantic Scene Completion with Multi-Feature Data Balancing Network

您可能还喜欢...

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘

Semantic Scene Completion with Multi-Feature Data Balancing Network

您可能还喜欢...

Smooth head tracking for virtual reality applications

Semi-Transparent Image Sensors for Eye-Tracking Applications

Real-time depth-of-field rendering for shadow

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘