Intelligent facial image coding driven by speech and phoneme

Shigeo Morishima, Kiyoharu Aizawa, Hiroshi Harashima

研究成果: Conference article査読

40 被引用数 (Scopus)


The authors propose and compare two types of model-based facial motion coding schemes, i.e., synthesis by rules and synthesis by parameters. In synthesis by rules, facial motion images are synthesized on the basis of rules extracted by analysis of training image samples that include all of the phonemes and coarticulation. This system can be utilized as an automatic facial animation synthesizer from text input or as a man-machine interface using the facial motion image. In synthesis by parameters, facial motion images are synthesized on the basis of a code word index of speech parameters. Experimental results indicate good performance for both systems, which can create natural facial-motion images with very low transmission rate. Details of 3-D modeling, algorithm synthesis, and performance are discussed.

ジャーナルICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
出版ステータスPublished - 1989 12 1
イベント1989 International Conference on Acoustics, Speech, and Signal Processing - Glasgow, Scotland
継続期間: 1989 5 231989 5 26

ASJC Scopus subject areas

  • Software
  • Signal Processing
  • Electrical and Electronic Engineering

フィンガープリント 「Intelligent facial image coding driven by speech and phoneme」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。