Efficient speech animation synthesis with vocalic lip shapes

Daisuke Mima*, Akinobu Maejima, Shigeo Morishima

*この研究の対応する著者

研究成果: Conference contribution

抄録

Computer-generated speech animations are commonly seen in video games and movies. Although high-quality facial motions can be created by the hand crafted work of skilled artists, this approach is not always suitable because of time and cost constraints. A data-driven approach [Taylor et al. 2012], such as machine learning to concatenate video portions of speech training data, has been utilized to generate natural speech animation, while a large number of target shapes are often required for synthesis. We can obtain smooth mouth motions from prepared lip shapes for typical vowels by using an interpolation of lip shapes with Gaussian mixture models (GMMs) [Yano et al. 2007]. However, the resulting animation is not directly generated from the measured lip motions of someone's actual speech.

本文言語English
ホスト出版物のタイトルACM SIGGRAPH 2013 Posters, SIGGRAPH 2013
DOI
出版ステータスPublished - 2013 8月 21
イベントACM Special Interest Group on Computer Graphics and Interactive Techniques Conference, SIGGRAPH 2013 - Anaheim, CA, United States
継続期間: 2013 7月 212013 7月 25

出版物シリーズ

名前ACM SIGGRAPH 2013 Posters, SIGGRAPH 2013

Conference

ConferenceACM Special Interest Group on Computer Graphics and Interactive Techniques Conference, SIGGRAPH 2013
国/地域United States
CityAnaheim, CA
Period13/7/2113/7/25

ASJC Scopus subject areas

  • コンピュータ グラフィックスおよびコンピュータ支援設計
  • コンピュータ ビジョンおよびパターン認識
  • ソフトウェア

フィンガープリント

「Efficient speech animation synthesis with vocalic lip shapes」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル