Reasonable Perception: Connecting Vision and Language Systems for Validating Scene Descriptions

小编映维 | 分类：XR | 2020年6月23日

Note: We don't have the ability to review paper

PubDate: March 2018

Teams: Massachusetts Institute of Technology

Writers: Leilani H. Gilpin;Cagri Zaman;Danielle Olson;Ben Z. Yuan

PDF: Reasonable Perception: Connecting Vision and Language Systems for Validating Scene Descriptions

Abstract

Understanding explanations of machine perception is an important step towards developing accountable, trustworthy machines. Furthermore, speech and vision are the primary modalities by which humans collect information about the world, but the linking of visual and natural language domains is a relatively new pursuit in computer vision, and it is difficult to test performance in a safe environment. To couple human visual understanding and machine perception, we present an explanatory system for creating a library of possible context-specific actions associated with 3D objects in immersive virtual worlds. We also contribute a novel scene description dataset, generated natively in virtual reality containing speech, image, gaze, and acceleration data. We discuss the development of a hybrid machine learning algorithm linking vision data with environmental affordances in natural language. Our findings demonstrate that it is possible to develop a model which can generate interpretable verbal descriptions of possible actions associated with recognized 3D objects within immersive VR environments.

本文链接：https://paper.nweon.com/2883

Reasonable Perception: Connecting Vision and Language Systems for Validating Scene Descriptions

您可能还喜欢...

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘

Reasonable Perception: Connecting Vision and Language Systems for Validating Scene Descriptions

您可能还喜欢...

Gaze-Vergence-Controlled See-Through Vision in Augmented Reality

The effect of gaming on accommodative and vergence facilities after exposure to virtual reality head-mounted display

Liquid Crystal Based 5 cm Adaptive Focus Lens to Solve Accommodation-Convergence (AC) Mismatch Issue of AR/VR/3D Displays

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘