Model-based talking face synthesis for anthropomorphic spoken dialog agent system

Tatsuo Yotsukura, Shigeo Morishima, Satoshi Nakamura

研究成果: Paper

6 引用 (Scopus)

抜粋

Towards natural human-machine communication, interface technologies by way of speech and image information have been intensively developed. An anthropomorphic dialog agent is an ideal system, which integrates spoken dialog and natural facial expressions. This paper reports on our project aiming to create a general-purpose toolkit for building an easily customizable anthropomorphic agent. There have been almost no tools so far such as intuitive, easy to understand, fully interactive, and open source. Our anthropomorphic agent is designed to fulfill these requirements. This toolkit consists four modules, multi modal dialog integration, speech recognition, speech synthesis, and face image synthesis. These modules are highly modularized and interlinked by a simple communication protocols. In this paper, we focus on the construction of an agent's face image synthesis. For this part lip movement control synchronous to the speech signal and facial emotion expression are the most important parts. We developed the face image synthesis module (FSM) that only requires one frontal face image, and can be used by any skill level of users. A user's original agent can be generated by easy adjustment of the frontal face image and the generic wire-frame model. The paper describes overall system diagram and specifically the agent's face image synthesis part.

元の言語English
ページ351-354
ページ数4
DOI
出版物ステータスPublished - 2003 1 1
外部発表Yes
イベント2003 Multimedia Conference - Proceedings of the 11th ACM International Conference on Multimedia, MM'03 - Berkeley, CA., United States
継続期間: 2003 11 42003 11 6

Conference

Conference2003 Multimedia Conference - Proceedings of the 11th ACM International Conference on Multimedia, MM'03
United States
Berkeley, CA.
期間03/11/403/11/6

ASJC Scopus subject areas

  • Computer Science(all)

フィンガープリント Model-based talking face synthesis for anthropomorphic spoken dialog agent system' の研究トピックを掘り下げます。これらはともに一意のフィンガープリントを構成します。

  • これを引用

    Yotsukura, T., Morishima, S., & Nakamura, S. (2003). Model-based talking face synthesis for anthropomorphic spoken dialog agent system. 351-354. 論文発表場所 2003 Multimedia Conference - Proceedings of the 11th ACM International Conference on Multimedia, MM'03, Berkeley, CA., United States. https://doi.org/10.1145/957013.957089