Multi-Channel Speech Enhancement Using Graph Neural Networks

编辑：映维 | 分类：XR | 2021年5月12日

Note: We don't have the ability to review paper

PubDate: June 6, 2021

Teams: Facebook Reality Labs Research

Writers: Panagiotis Tzirakis, Anurag Kumar, Jacob Donley

PDF: Multi-Channel Speech Enhancement Using Graph Neural Networks

Multi-Channel Speech Enhancement Using Graph Neural Networks

Abstract

Multi-channel speech enhancement aims to extract clean speech from a noisy mixture using signals captured from multiple microphones. Recently proposed methods tackle this problem by incorporating deep neural network models with spatial filtering techniques such as the minimum variance distortionless response (MVDR) beamformer. In this paper, we introduce a different research direction by viewing each audio channel as a node lying in a non-Euclidean space and, specifically, a graph. This formulation allows us to apply graph neural networks (GNN) to find spatial correlations among the different channels (nodes). We utilize graph convolution networks (GCN) by incorporating them in the embedding space of a U-Net architecture. We use LibriSpeech dataset and simulate room acoustics data to extensively experiment with our approach using different array types, and number of microphones. Results indicate the superiority of our approach when compared to prior state-of-the-art method.

本文链接：https://paper.nweon.com/9870

Multi-Channel Speech Enhancement Using Graph Neural Networks

您可能还喜欢...

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘

Multi-Channel Speech Enhancement Using Graph Neural Networks

您可能还喜欢...

Batmen X The Puzzler - Escaping AR’s Drawbacks with Augmented Virtuality and Low Cost Sensors

The RobotriX: An eXtremely Photorealistic and Very-Large-Scale Indoor Dataset of Sequences with Robot Trajectories and Interactions

Self-position awareness-based presence and interaction in virtual reality

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘