Real-time facial action image synthesis system driven by speech and text

Shigeo Morishima, Kiyoharu Aizawa, Hiroshi Harashima

研究成果: Conference contribution

12 被引用数 (Scopus)


Automatic facial motion image synthesis schemes and a real-time system design are presented. The purpose of this scheme is to realize an intelligent human-machine interface or intelligent communication system with talking head images. A human face is reconstructed with 3D surface model and texture mapping technique on the display of terminal. Facial motion images are synthesized naturally by transformation of the lattice points on wire frames. Two types of motion drive methods, text to image conversion and speech to image conversion are proposed in this paper. In the former manner, the synthesized head can speak some given texts naturally and in the latter case, some mouth and jaw motions can be synthesized in time to speech signal of behind speaker. These schemes were implemented to a parallel image computer and a real-time image synthesizer could output facial motion images to the display as fast as video rate.

ホスト出版物のタイトルProceedings of SPIE - The International Society for Optical Engineering
編集者Murat Kunt
出版社Publ by Int Soc for Optical Engineering
出版ステータスPublished - 1990 12 1
イベントVisual Communications and Image Processing '90 - Lausanne, Switz
継続期間: 1990 10 11990 10 4


名前Proceedings of SPIE - The International Society for Optical Engineering
1360 pt 2


OtherVisual Communications and Image Processing '90
CityLausanne, Switz

ASJC Scopus subject areas

  • Electronic, Optical and Magnetic Materials
  • Condensed Matter Physics
  • Computer Science Applications
  • Applied Mathematics
  • Electrical and Electronic Engineering

フィンガープリント 「Real-time facial action image synthesis system driven by speech and text」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。