Man-machine interaction using a vision system with dual viewing angles

Ying Jieh Huang, Hiroshi Dohi, Mitsurti Ishizuka

研究成果: Article査読

1 被引用数 (Scopus)

抄録

This paper describes a vision system with dual viewing angles, i.e., wide and narrow viewing angles, and a scheme of user-friendly speech dialogue environment based on the vision system. The wide viewing angle provides a wide viewing field for wide range motion tracking, and the narrow viewing angle is capable of following a target in wide viewing field to take the image of the target with sufficient resolution. For a fast and robust motion tracking, modified motion energy (MME) and existence energy (££) arc defined to detect the motion of the target and extract the motion region at the same time. Instead of using a physical device such as a foot switch commonly used in speech dialogue systems, the begin/end of an utterance is detected from the movement of user's mouth in our system. Without recognizing the movement of lips directly, the shape variation of the region between lips is tracked for more stable recognition of the span of a dialogue. The tracking speed is about 10 frames/sec when no recognition is performed and about 5 frames/sec when both tracking and recognition are performed without using any special hardware.

本文言語English
ページ(範囲)1074-1083
ページ数10
ジャーナルIEICE Transactions on Information and Systems
E80-D
11
出版ステータスPublished - 1997
外部発表はい

ASJC Scopus subject areas

  • Information Systems
  • Computer Graphics and Computer-Aided Design
  • Software

フィンガープリント 「Man-machine interaction using a vision system with dual viewing angles」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル