RWC multimodal database for interactions by integration of spoken language and visual information

S. Hayamizu*, O. Hasegawa, K. Itou, K. Sakaue, K. Tanaka, S. Nagaya, M. Nakazawa, T. Endoh, F. Togawa, K. Sakamoto, K. Yamamoto

*この研究の対応する著者

研究成果: Paper査読

6 被引用数 (Scopus)

抄録

This paper describes our design policy and prototype data collection of RWC (Real World Computing Program) multimodal database. The database is intended for research and development on the integration of spoken language and visual information for human computer interactions. The interactions are supposed to use image recognition, image synthesis, speech recognition, and speech synthesis. Visual information also includes non-verbal communication such as interactions using hand gestures and facial expressions between human and a human-like CG (Computer Graphics) agent with a face and hands. Based on the experiments of interactions with these modes, specifications of the database are discussed from the viewpoint of controlling the variability and cost for the collection.

本文言語English
ページ2171-2174
ページ数4
出版ステータスPublished - 1996
外部発表はい
イベントProceedings of the 1996 International Conference on Spoken Language Processing, ICSLP. Part 1 (of 4) - Philadelphia, PA, USA
継続期間: 1996 10 31996 10 6

Other

OtherProceedings of the 1996 International Conference on Spoken Language Processing, ICSLP. Part 1 (of 4)
CityPhiladelphia, PA, USA
Period96/10/396/10/6

ASJC Scopus subject areas

  • コンピュータ サイエンス(全般)

フィンガープリント

「RWC multimodal database for interactions by integration of spoken language and visual information」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル