Spatial Covariance Matrix Estimation for Reverberant Speech with Application to Speech Enhancement

编辑：映维 | 分类：XR | 2020年12月29日

Note: We don't have the ability to review paper

PubDate: October 25, 2020

Teams: Ben-Gurion University of the Negev;Facebook Reality Labs

Writers: Ran Weisman, Vladimir Tourbabin, Paul Calamia, Boaz Rafaely

PDF: Spatial Covariance Matrix Estimation for Reverberant Speech with Application to Speech Enhancement

Spatial Covariance Matrix Estimation for Reverberant Speech with Application to Speech Enhancement

Abstract

A wide range of applications in speech and audio signal processing incorporate a model of room reverberation based on the spatial covariance matrix (SCM). Typically, a diffuse sound field model is used, but although the diffuse model simplifies formulations, it may lead to limited accuracy in realistic sound fields, resulting in potential degradation in performance. While some extensions to the diffuse field SCM recently have been presented, accurate modeling for real sound fields remains an open problem. In this paper, a method for estimating the SCM of reverberant speech is proposed, based on the selection of time-frequency bins dominated by reverberation. The method is data-based and estimates the SCM for a specific acoustic scene. It is therefore applicable to realistic reverberant fields. An application of the proposed method to optimal beamforming for speech enhancement is presented, using the plane wave density function in the spherical harmonics (SH) domain. It is shown that the use of the proposed SCM outperforms the commonly used diffuse field SCM, suggesting the method is more successful in capturing the statistics of the late part of the reverberation.

本文链接：https://paper.nweon.com/8498

Spatial Covariance Matrix Estimation for Reverberant Speech with Application to Speech Enhancement

您可能还喜欢...

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘

Spatial Covariance Matrix Estimation for Reverberant Speech with Application to Speech Enhancement

您可能还喜欢...

FaceVR: Real-Time Facial Reenactment and Eye Gaze Control in Virtual Realit

Echo State Learning for Wireless Virtual Reality Resource Allocation in UAV-enabled LTE-U Networks

Four Different Multimodal Setups for Non-Aerial Vehicle Simulations—A Case Study with a Speedboat Simulator

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘