Situated and Interactive Multimodal Conversations

编辑：映维 | 分类：HCI / XR | 2020年12月29日

Note: We don't have the ability to review paper

PubDate: December 8, 2020

Teams: Facebook

Writers: Seungwhan Moon, Satwik Kottur, Paul A. Crook, Ankita De, Shivani Poddar, Theodore Levin, David Whitney, Daniel Difranco, Ahmad Beirami, Eunjoon Cho, Rajen Subba, Alborz Geramifard

PDF: Situated and Interactive Multimodal Conversations

Situated and Interactive Multimodal Conversations

Abstract

Next generation virtual assistants are envisioned to handle multimodal inputs (e.g., vision, memories of previous interactions, and the user’s utterances), and perform multimodal actions (e.g., displaying a route while generating the system’s utterance). We introduce Situated Interactive MultiModal Conversations (SIMMC) as a new direction aimed at training agents that take multimodal actions grounded in a co-evolving multimodal input context in addition to the dialog history. We provide two SIMMC datasets totalling ∼13K human-human dialogs (∼169K utterances) collected using a multimodal Wizard-of-Oz (WoZ) setup, on two shopping domains: (a) furniture – grounded in a shared virtual environment; and (b) fashion – grounded in an evolving set of images. Datasets include multimodal context of the items appearing in each scene, and contextual NLU, NLG and coreference annotations using a novel and unified framework of SIMMC conversational acts for both user and assistant utterances.

Finally, we present several tasks within SIMMC as objective evaluation protocols, such as structural API prediction, response generation, and dialog state tracking. We benchmark a collection of existing models on these SIMMC tasks as strong baselines, and demonstrate rich multimodal conversational interactions. Our data, annotations, and models are publicly available.

本文链接：https://paper.nweon.com/8510

Situated and Interactive Multimodal Conversations

您可能还喜欢...

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘

Situated and Interactive Multimodal Conversations

您可能还喜欢...

Evaluating the efficacy of haptic feedback, 360° treadmill-integrated Virtual Reality framework and longitudinal training on decision-making performance in a complex search-and-shoot simulation

The Transformation Method of Vehicle Dynamics Right Hand Coordinate System and Image Left Hand Coordinate System

ThermalPen: Investigating the Influence of Thermal Haptic Feedback for Creativity in 3D Sketching

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘