Points2Sound: From mono to binaural audio using 3D point cloud scenes

编辑：映维 | 分类：Perception / XR | 2021年5月10日

Note: We don't have the ability to review paper

PubDate: Apr 2021

Teams: University of Music and Performing Arts Vienna

Writers: Francesc Lluís, Vasileios Chatziioannou, Alex Hofmann

PDF: Points2Sound: From mono to binaural audio using 3D point cloud scenes

Points2Sound: From mono to binaural audio using 3D point cloud scenes

Abstract

Binaural sound that matches the visual counterpart is crucial to bring meaningful and immersive experiences to people in augmented reality (AR) and virtual reality (VR) applications. Recent works have shown the possibility to generate binaural audio from mono using 2D visual information as guidance. Using 3D visual information may allow for a more accurate representation of a virtual audio scene for VR/AR applications. This paper proposes Points2Sound, a multi-modal deep learning model which generates a binaural version from mono audio using 3D point cloud scenes. Specifically, Points2Sound consist of a vision network which extracts visual features from the point cloud scene to condition an audio network, which operates in the waveform domain, to synthesize the binaural version. Both quantitative and perceptual evaluations indicate that our proposed model is preferred over a reference case, based on a recent 2D mono-to-binaural model.

本文链接：https://paper.nweon.com/9848

Points2Sound: From mono to binaural audio using 3D point cloud scenes

您可能还喜欢...

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘

Points2Sound: From mono to binaural audio using 3D point cloud scenes

您可能还喜欢...

Rendering 3D virtual objects in mid-air using controlled magnetic fields

Evaluation of a Virtual Reality-based Buffet to Address Challenges in Health Research and Practice

Towards determining thresholds for room divergence: A pilot study on perceived externalization

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘