Extracting Specific Voice from Mixed Audio Source

编辑：映维 | 分类：XR | 2020年12月4日

Note: We don't have the ability to review paper

PubDate: December 2019

Teams: LINE Corporation

Writers: Kunihiko Sato

PDF: Extracting Specific Voice from Mixed Audio Source

Extracting Specific Voice from Mixed Audio Source

Abstract

We propose auditory diminished reality by a deep neural network (DNN) extracting a single speech signal from a mixture of sounds containing other speakers and background noise. To realize the proposed DNN, we introduce a new dataset comprised of multi-speakers and environment noises. We conduct evaluations for measuring the source separation quality of the DNN. Additionally, we compare the separation quality of models learned with different amounts of training data. As a result, we found there is no significant difference in the separation quality between 10 and 30 minutes of the target speaker’s speech length for training data.

本文链接：https://paper.nweon.com/8341

Extracting Specific Voice from Mixed Audio Source

您可能还喜欢...

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘

Extracting Specific Voice from Mixed Audio Source

您可能还喜欢...

Pre-Calibrated Visuo-Haptic Co-Location Improves Execution in Virtual Environments

Long-Term Visual Localization with Semantic Enhanced Global Retrieval

Camera-Based Selection with Cardboard HMDs

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘