Speech recognition technology combined with three dimensional lip movement

K. Komiya, R. Ishikawa, K. Momose

研究成果: Conference article査読

抄録

In order to improve speech recognition efficiency under background noise such as out of doors, we propose a new recognition technology combined with three dimensional lip movements. In this paper firstly, three-dimensional movements at four positions of the mouth were measured using principal component analysis to clarify which positions and which directions are main contributors to pronunciation. Secondly, recognition evaluation tests for 50 Japanese words were carried out under noise levels ranging from 40 to 80 dB. In the experiment, over 80% of recognition efficiency was measured at 70dB and improvement of 40% was obtained compared with ordinary speech recognition. From the experimental results, the proposed method can be modified to be used as practical speech recognition technology. Finally, research subjects were picked up such as an improvement in precision of measuring lip movement and experiments and data collection out of doors.

本文言語English
ページ(範囲)95-102
ページ数8
ジャーナルProceedings of SPIE - The International Society for Optical Engineering
4298
DOI
出版ステータスPublished - 2001 1 1
外部発表はい
イベントThree-Dimensional Image Capture and Applications IV - San Jose, CA, United States
継続期間: 2001 1 242001 1 25

ASJC Scopus subject areas

  • 電子材料、光学材料、および磁性材料
  • 凝縮系物理学
  • コンピュータ サイエンスの応用
  • 応用数学
  • 電子工学および電気工学

フィンガープリント

「Speech recognition technology combined with three dimensional lip movement」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル