loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: J. D. Edge and A. Hilton

Affiliation: Centre for Vision, Speech and Signal Processing, School of Electronic and Physical Sciences, University of Surrey, United Kingdom

Keyword(s): Facial Animation, Speech Synthesis, Virtual Humans.

Related Ontology Subjects/Areas/Topics: Animation and Simulation ; Character Animation ; Computer Vision, Visualization and Computer Graphics ; Facial Animation

Abstract: Data-driven approaches to 2D facial animation from video have achieved highly realistic results. In this paper we introduce a process for visual speech synthesis from 3D video capture to reproduce the dynamics of 3D face shape and appearance. Animation from real speech is performed by path optimisation over a graph representation of phonetically segmented captured 3D video. A novel similarity metric using a hierarchical wavelet decomposition is presented to identify transitions between 3D video frames without visual artifacts in facial shape, appearance or dynamics. Face synthesis is performed by playing back segments of the captured 3D video to accurately reproduce facial dynamics. The framework allows visual speech synthesis from captured 3D video with minimal user intervention. Results are presented for synthesis from a database of 12minutes (18000 frames) of 3D video which demonstrate highly realistic facial animation.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.118.164.227

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
D. Edge, J. and Hilton, A. (2007). VISUAL SPEECH SYNTHESIS FROM 3D VIDEO. In Proceedings of the Second International Conference on Computer Graphics Theory and Applications (VISIGRAPP 2007) - Volume 1: GRAPP; ISBN 978-972-8865-72-6; ISSN 2184-4321, SciTePress, pages 57-62. DOI: 10.5220/0002080400570062

@conference{grapp07,
author={J. {D. Edge} and A. Hilton},
title={VISUAL SPEECH SYNTHESIS FROM 3D VIDEO},
booktitle={Proceedings of the Second International Conference on Computer Graphics Theory and Applications (VISIGRAPP 2007) - Volume 1: GRAPP},
year={2007},
pages={57-62},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002080400570062},
isbn={978-972-8865-72-6},
issn={2184-4321},
}

TY - CONF

JO - Proceedings of the Second International Conference on Computer Graphics Theory and Applications (VISIGRAPP 2007) - Volume 1: GRAPP
TI - VISUAL SPEECH SYNTHESIS FROM 3D VIDEO
SN - 978-972-8865-72-6
IS - 2184-4321
AU - D. Edge, J.
AU - Hilton, A.
PY - 2007
SP - 57
EP - 62
DO - 10.5220/0002080400570062
PB - SciTePress