Machine Learning-Based Room Classification for Selecting Binaural Room Impulse Responses in Augmented Reality Applications

编辑：映维 | 分类：Perception / XR | 2022年4月28日

Note: We don't have the ability to review paper

PubDate: November 2021

Teams: TH Köln

Writers: Damian Dziwis; Simon Zimmermann; Tim Lübeck; Johannes M. Arend; David Bau; Christoph Pörschmann

PDF: Machine Learning-Based Room Classification for Selecting Binaural Room Impulse Responses in Augmented Reality Applications

Machine Learning-Based Room Classification for Selecting Binaural Room Impulse Responses in Augmented Reality Applications

Abstract

A key attribute of augmented reality (AR) applications is the matching reverberation of virtual sounds to the room acoustics of the real environment. However, especially in real-time scenarios where the properties of rapidly changing surroundings are unknown, creating a persistently coherent sound field synthesis within a real space is a challenging problem. While AR devices and their sensors can usually provide depth information within the field of view of the user, retrieving a complete geometric model requires significant time and user activity. Prior acoustic measurements or scans of the deployment area also severely limit many use cases, especially in the consumer sector. In this paper, we propose an automatic system that provides a fast selection of room categories and their corresponding binaural reverberation using only monoscopic images as input information. The proposed system combines existing approaches of machine learning (ML) based room classification and parametric synthesis of binaural room impulse responses (BRIRs) to provide room reverberation for arbitrary indoor environments. As a proof of concept, we present a demonstrator developed in Cycling’74s Max linked to a python-based ML model. For the ML model, we use the convolutional neural network (CNN) GoogLeNet architecture trained on a subset of the Places365 data set. This subset contains 20 custom indoor room categories which are composed of the original categories that share similar acoustic properties. The demonstrator captures images and automatically selects binaural reverberation based on the predictions of the ML classifier. Monophonic stimuli are reverberated and presented using dynamic headphone-based binauralization.

本文链接：https://paper.nweon.com/12129

Machine Learning-Based Room Classification for Selecting Binaural Room Impulse Responses in Augmented Reality Applications

您可能还喜欢...

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘

Machine Learning-Based Room Classification for Selecting Binaural Room Impulse Responses in Augmented Reality Applications

您可能还喜欢...

Vertical Field-of-View Extension and Walking Characteristics in Head-Worn Virtual Environments

Estimation of optimal encoding ladders for tiled 360° VR video in adaptive streaming systems

LinkGlide-S: A Wearable Multi-Contact Tactile Display Aimed at Rendering Object Softness at the Palm with Impedance Control in VR and Telemanipulation

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘