ArK: Augmented Reality with Knowledge Interactive Emergent Ability

编辑：映维 | 分类：XR | 2023年5月10日

Note: We don't have the ability to review paper

PubDate: Apr 2023

Teams: Microsoft Research, Redmond † MILA §University of Washington \ UCLA

Writers: Qiuyuan Huang, Jae Sung Park, Abhinav Gupta, Paul Bennett, Ran Gong, Subhojit Som, Baolin Peng, Owais Khan Mohammed, Chris Pal, Yejin Choi, Jianfeng Gao

PDF: ArK: Augmented Reality with Knowledge Interactive Emergent Ability

ArK: Augmented Reality with Knowledge Interactive Emergent Ability

Abstract

Despite the growing adoption of mixed reality and interactive AI agents, it remains challenging for these systems to generate high quality 2D/3D scenes in unseen environments. The common practice requires deploying an AI agent to collect large amounts of data for model training for every new task. This process is costly, or even impossible, for many domains. In this study, we develop an infinite agent that learns to transfer knowledge memory from general foundation models (e.g. GPT4, DALLE) to novel domains or scenarios for scene understanding and generation in the physical or virtual world. The heart of our approach is an emerging mechanism, dubbed Augmented Reality with Knowledge Inference Interaction (ArK), which leverages knowledge-memory to generate scenes in unseen physical world and virtual reality environments. The knowledge interactive emergent ability (Figure 1) is demonstrated as the observation learns i) micro-action of cross-modality: in multi-modality models to collect a large amount of relevant knowledge memory data for each interaction task (e.g., unseen scene understanding) from the physical reality; and ii) macro-behavior of reality-agnostic: in mix-reality environments to improve interactions that tailor to different characterized roles, target variables, collaborative information, and so on. We validate the effectiveness of ArK on the scene generation and editing tasks. We show that our ArK approach, combined with large foundation models, significantly improves the quality of generated 2D/3D scenes, compared to baselines, demonstrating the potential benefit of incorporating ArK in generative AI for applications such as metaverse and gaming simulation.

本文链接：https://paper.nweon.com/14363

ArK: Augmented Reality with Knowledge Interactive Emergent Ability

您可能还喜欢...

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘

ArK: Augmented Reality with Knowledge Interactive Emergent Ability

您可能还喜欢...

Binaural signal matching with arbitrary array based on a sound field model

Ambient Intelligence for Next-Generation AR

Enhanced Visual Feedback with Decoupled Viewpoint Control in Immersive Humanoid Robot Teleoperation using SLAM

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘