Enhanced robot speech recognition based on microphone array source separation and missing feature theory

Shun'ichi Yamamoto*, Jean Marc Valin, Kazuhiro Nakadai, Jean Rouat, François Michaud, Tetsuya Ogata, Hiroshi G. Okuno

*この研究の対応する著者

研究成果: Conference contribution

53 被引用数 (Scopus)

抄録

A humanoid robot under real-world environments usually hears mixtures of sounds, and thus three capabilities are essential for robot audition; sound source localization, separation, and recognition of separated sounds. While the first two are frequently addressed, the last one has not been studied so much. We present a system that gives a humanoid robot the ability to localize, separate and recognize simultaneous sound sources. A microphone array is used along with a real-time dedicated implementation of Geometric Source Separation (GSS) and a multi-channel post-filter that gives us a further reduction of interferences from other sources. An automatic speech recognizer (ASR) based on the Missing Feature Theory (MFT) recognizes separated sounds in real-time by generating missing feature masks automatically from the post-filtering step. The main advantage of this approach for humanoid robots resides in the fact that the ASR with a clean acoustic model can adapt the distortion of separated sound by consulting the post-filter feature masks. Recognition rates are presented for three simultaneous speakers located at 2m from the robot. Use of both the post-filter and the missing feature mask results in an average reduction in error rate of 42% (relative).

本文言語English
ホスト出版物のタイトルProceedings of the 2005 IEEE International Conference on Robotics and Automation
ページ1477-1482
ページ数6
DOI
出版ステータスPublished - 2005
外部発表はい
イベント2005 IEEE International Conference on Robotics and Automation - Barcelona, Spain
継続期間: 2005 4月 182005 4月 22

出版物シリーズ

名前Proceedings - IEEE International Conference on Robotics and Automation
2005
ISSN(印刷版)1050-4729

Conference

Conference2005 IEEE International Conference on Robotics and Automation
国/地域Spain
CityBarcelona
Period05/4/1805/4/22

ASJC Scopus subject areas

  • ソフトウェア
  • 制御およびシステム工学
  • 人工知能
  • 電子工学および電気工学

フィンガープリント

「Enhanced robot speech recognition based on microphone array source separation and missing feature theory」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル