Improved RGBD semantic segmentation using multi-scale features

小编映维 | 分类：CV / XR | 2020年10月26日

Note: We don't have the ability to review paper

PubDate: July 2018

Teams: Shanghai Jiao Tong University；Luoyang Institute of Electro-optical Equipment

Writers: Xiaoning Gao; Meng Cai; Jianxun Li

PDF: Improved RGBD semantic segmentation using multi-scale features

Abstract

RGBD semantic segmentation is a popular task in computer vision with applications in autonomous vehicles and virtual reality. This problem is challenging due to the cluttered, dense and diverse scenes. To solve the loss of context information in dense semantic scene segmentation, we propose a novel architecture built on multi-scale feature representation that contains more global and local context cues. The multi-scale features, which are generated via aggregating 3D region features and sparse coding SIFT features extracted from multiresolution RGB and depth images, are fed into a softmax classifier to labeling each region produced by hierarchical segmentation with a predefined class, that is our final semantic scene segmentation. In addition, compared to the rough four categories predefined from the 894 pixel categories in NYUD2 dataset, we define the 40 detailed pixel classes that cover most common object categories and makes a fine-grained semantic segmentation. Extensive experiments on the standard NYUD2 benchmark demonstrate the effectiveness of our method.

本文链接：https://paper.nweon.com/7551

Improved RGBD semantic segmentation using multi-scale features

您可能还喜欢...

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘

Improved RGBD semantic segmentation using multi-scale features

您可能还喜欢...

sur.faced.io: augmented reality content creation for your face and beyond by drawing on paper

Optical FIow, Perturbation Velocities and Postural Response In Virtual Reality

iHuman3D: Intelligent Human Body 3D Reconstruction using a Single Flying Camera

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘