Assessment of general applicability of robot audition system by recognizing three simultaneous speeches

Shun'ichi Yamamoto*, Kazuhiro Nakadai, Hiroshi Tsujino, Hiroshi G. Okuno

*この研究の対応する著者

研究成果: Conference contribution

21 被引用数 (Scopus)

抄録

Robot audition is a critical technology in creating an intelligent robot operating in daily environments. We have developed such a robot audition system by using a new interface between sound source separation and automatic speech recognition (ASR). A mixture of speeches captured with a pair of microphones installed in the ear positions of a humanoid is separated into each speech by using active direction-pass filter (ADPF). The ADPF extracts a sound source originating from a specific direction in real-time by using interaural phase and intensity differences. The separated speech is recognized by a speech recognizer based on the missing feature theory (MFT). By using a missing feature mask, the MFT based ASR neglects distorted and missing features caused during the speech separation. A missing feature mask for each separated speech is generated in speech separation and is sent to the ASR with the separated speech. Thus, this new integration improves the performance of ASR. However, the generality of this robot audition system has not been assessed so far. In this paper, we assess its general applicability by implementing it on the three humanoids, i.e., ASIMO of Honda, SIG2, and Replie of Kyoto University. By using three simultaneous speeches as benchmarks, the robot audition system improved the performance of ASR over 50% in every humanoid, and thus its general applicability was confirmed.

本文言語English
ホスト出版物のタイトル2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
ページ2111-2116
ページ数6
3
出版ステータスPublished - 2004
外部発表はい
イベント2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) - Sendai
継続期間: 2004 9月 282004 10月 2

Other

Other2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
CitySendai
Period04/9/2804/10/2

ASJC Scopus subject areas

  • 工学(全般)

フィンガープリント

「Assessment of general applicability of robot audition system by recognizing three simultaneous speeches」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル