Recognition of para-linguistic information and its application to spoken dialogue system

Shinya Fujie, Yasushi Ejiri, Yosuke Matsusaka, Hideaki Kikuchi, Tetsunori Kobayashi

研究成果: Conference contribution

10 引用 (Scopus)

抜粋

The human-human interactions in a spoken dialogue seem to use not only linguistic information in the utterances but also some sorts of additional information supporting linguistic information. We call these sorts of additional information "para-linguistic information". In this paper, we present a recognition method of attitudes by prosodic information, and a recognition method of head gestures. In the former method, in order to recognize two attitudes, such as "positive" and "negative", F0 pattern and phoneme alignment are introduced as features. In the latter method, in order to recognize three gestures, such as "nod", "tilt" and "shake", left-to-right HMM is introduced as the probabilistic model as well as optical flow is introduced as features. Experiment results show that these methods are sufficient to recognize user's attitude as para-linguistic information. Finally, we show a proto-type spoken dialogue system using para-linguistic information and how these sorts of information contribute the efficient conversation.

元の言語English
ホスト出版物のタイトル2003 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2003
出版者Institute of Electrical and Electronics Engineers Inc.
ページ231-236
ページ数6
ISBN(電子版)0780379802, 9780780379800
DOI
出版物ステータスPublished - 2003 1 1
イベントIEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2003 - St. Thomas, United States
継続期間: 2003 11 302003 12 4

出版物シリーズ

名前2003 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2003

Other

OtherIEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2003
United States
St. Thomas
期間03/11/3003/12/4

    フィンガープリント

ASJC Scopus subject areas

  • Signal Processing
  • Computer Vision and Pattern Recognition
  • Computer Science Applications

これを引用

Fujie, S., Ejiri, Y., Matsusaka, Y., Kikuchi, H., & Kobayashi, T. (2003). Recognition of para-linguistic information and its application to spoken dialogue system. : 2003 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2003 (pp. 231-236). [1318446] (2003 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2003). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ASRU.2003.1318446