Efficient speech animation synthesis with vocalic lip shapes

Daisuke Mima, Akinobu Maejima, Shigeo Morishima

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Computer-generated speech animations are commonly seen in video games and movies. Although high-quality facial motions can be created by the hand crafted work of skilled artists, this approach is not always suitable because of time and cost constraints. A data-driven approach [Taylor et al. 2012], such as machine learning to concatenate video portions of speech training data, has been utilized to generate natural speech animation, while a large number of target shapes are often required for synthesis. We can obtain smooth mouth motions from prepared lip shapes for typical vowels by using an interpolation of lip shapes with Gaussian mixture models (GMMs) [Yano et al. 2007]. However, the resulting animation is not directly generated from the measured lip motions of someone's actual speech.

Original languageEnglish
Title of host publicationACM SIGGRAPH 2013 Posters, SIGGRAPH 2013
DOIs
Publication statusPublished - 2013 Aug 21
EventACM Special Interest Group on Computer Graphics and Interactive Techniques Conference, SIGGRAPH 2013 - Anaheim, CA, United States
Duration: 2013 Jul 212013 Jul 25

Publication series

NameACM SIGGRAPH 2013 Posters, SIGGRAPH 2013

Conference

ConferenceACM Special Interest Group on Computer Graphics and Interactive Techniques Conference, SIGGRAPH 2013
CountryUnited States
CityAnaheim, CA
Period13/7/2113/7/25

ASJC Scopus subject areas

  • Computer Graphics and Computer-Aided Design
  • Computer Vision and Pattern Recognition
  • Software

Fingerprint Dive into the research topics of 'Efficient speech animation synthesis with vocalic lip shapes'. Together they form a unique fingerprint.

  • Cite this

    Mima, D., Maejima, A., & Morishima, S. (2013). Efficient speech animation synthesis with vocalic lip shapes. In ACM SIGGRAPH 2013 Posters, SIGGRAPH 2013 [2] (ACM SIGGRAPH 2013 Posters, SIGGRAPH 2013). https://doi.org/10.1145/2503385.2503388