Comic-Guided Speech Synthesis

小编映维 | 分类：XR | 2020年5月13日

Note: We don't have the ability to review paper

PubDate: Nov 2019

Teams: Beijing Institute of Technology 2Inception Institute of Artificial Intelligence 3George Mason University

Writers: Yujia Wang1 Wenguan Wang1,2 Wei Liang1 Lap-Fai Yu

PDF: Comic-Guided Speech Synthesis

Project: Comic-Guided Speech Synthesis

Comic-Guided Speech Synthesis

Abstract

We introduce a novel approach for synthesizing realistic speeches for comics. Using a comic page as input, our approach synthesizes speeches for each comic character following the reading flow. It adopts a cascading strategy to synthesize speeches in two stages: Comic Visual Analysis and Comic Speech Synthesis. In the first stage, the input comic page is analyzed to identify the gender and age of the characters, as well as texts each character speaks and corresponding emotion. Guided by this analysis, in the second stage, our approach synthesizes realistic speeches for each character, which are consistent with the visual observations. Our experiments show that the proposed approach can synthesize realistic and lively speeches for different types of comics. Perceptual studies performed on the synthesis results of multiple sample comics validate the efficacy of our approach.

本文链接：https://paper.nweon.com/1014

Comic-Guided Speech Synthesis

您可能还喜欢...

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘

Comic-Guided Speech Synthesis

您可能还喜欢...

GlideReality: a highly immersive VR System augmented by a novel multi-modal and multi-contact cutaneous wearable display

ARWalker: A Virtual Walking Companion Application

Mapping Eye Vergence Angle to the Depth of Real and Virtual Objects as an Objective Measure of Depth Perception

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘