Multichannel Speech Enhancement without Beamforming

编辑：映维 | 分类：XR | 2022年4月14日

Note: We don't have the ability to review paper

PubDate: May 2022

Teams: Meta,The Ohio State University

Writers: Ashutosh Pandey, Buye Xu, Anurag Kumar, Jacob Donley, Paul Calamia, DeLiang Wang

PDF: Multichannel Speech Enhancement without Beamforming

Multichannel Speech Enhancement without Beamforming

Abstract

Deep neural networks are often coupled with traditional spatial filters, such as MVDR beamformers for effectively exploiting spatial information. Even though single-stage end-to-end supervised models can obtain impressive enhancement, combining them with a traditional beamformer and a DNN-based post-filter in a multistage processing provides additional improvements. In this work, we propose a two-stage strategy for multi-channel speech enhancement that does not require a traditional beamformer for additional performance. First, we propose a novel attentive dense convolutional network (ADCN) for estimating real and imaginary parts of complex spectrogram. ADCN obtains state-of-the-art results among single-stage models. Next, we use ADCN with a recently proposed triple-path attentive recurrent network (TPARN) for estimating waveform samples. The proposed strategy uses two insights; first, using different approaches in two stages; and second, using a stronger model in the first stage. We illustrate the efficacy of our strategy by evaluating multiple models in a two-stage approach with and without a traditional beamformer.

本文链接：https://paper.nweon.com/11951

Multichannel Speech Enhancement without Beamforming

您可能还喜欢...

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘

Multichannel Speech Enhancement without Beamforming

您可能还喜欢...

Wireless Sensing Data Collection and Processing for Metaverse Avatar Construction

Development of laparoscopic cholecystectomy simulator based on unity game engine

Occlusion Resistant Network for 3D Face Reconstruction

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘