Voice separation with an unknown number of multiple speakers

编辑：映维 | 分类：XR | 2020年7月13日

Note: We don't have the ability to review paper

PubDate: July 10, 2020

Teams: Facebook AI；Tel-Aviv University

Writers: Eliya Nachmani；Yossi Adi；Lior Wolf

PDF: Voice separation with an unknown number of multiple speakers

Voice separation with an unknown number of multiple speakers

Abstract

We present a new method for separating a mixed audio sequence, in which multiple voices speak
simultaneously. The new method employs gated neural networks that are trained to separate the
voices at multiple processing steps, while maintaining the speaker in each output channel fixed. A different model is trained for every number of possible speakers, and the model with the largest
number of speakers is employed to select the actual number of speakers in a given sample. Our
method greatly outperforms the current state of the art, which, as we show, is not competitive for
more than two speakers.

本文链接：https://paper.nweon.com/3752

Voice separation with an unknown number of multiple speakers

您可能还喜欢...

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘

Voice separation with an unknown number of multiple speakers

您可能还喜欢...

The Effect of Task on Visual Attention in Interactive Virtual Environments

Infrastructure-based Multi-Camera Calibration using Radial Projections

Digital Texture Voxels for Stretchable Morphing Skin Applications

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘