Improvement in listening capability for humanoid robot HRP-2

Toru Takahashi*, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno


研究成果: Conference contribution

6 被引用数 (Scopus)


This paper describes improvement of sound source separation for a simultaneous automatic speech recognition (ASR) system of a humanoid robot. A recognition error in the system is caused by a separation error and interferences of other sources. In separability, an original geometric source separation (GSS) is improved. Our GSS uses a measured robot's head related transfer function (HRTF) to estimate a separation matrix. As an original GSS uses a simulated HRTF calculated based on a distance between microphone and sound source, there is a large mismatch between the simulated and the measured transfer functions. The mismatch causes a severe degradation of recognition performance. Faster convergence speed of separation matrix reduces separation error. Our approach gives a nearer initial separation matrix based on a measured transfer function from an optimal separation matrix than a simulated one. As a result, we expect that our GSS improves the convergence speed. Our GSS is also able to handle an adaptive step-size parameter. These new features are added into open source robot audition software (OSS) called "HARK" which is newly updated as version 1.0.0. The HARK has been installed on a HRP-2 humanoid with an 8-element microphone array. The listening capability of HRP-2 is evaluated by recognizing a target speech signal which is separated from a simultaneous speech signal by three talkers. The word correct rate (WCR) of ASR improves by 5 points under normal acoustic environments and by 10 points under noisy environments. Experimental results show that HARK 1.0.0 improves the robustness against noises.

ホスト出版物のタイトル2010 IEEE International Conference on Robotics and Automation, ICRA 2010
出版ステータスPublished - 2010 8月 26
イベント2010 IEEE International Conference on Robotics and Automation, ICRA 2010 - Anchorage, AK, United States
継続期間: 2010 5月 32010 5月 7


名前Proceedings - IEEE International Conference on Robotics and Automation


Conference2010 IEEE International Conference on Robotics and Automation, ICRA 2010
国/地域United States
CityAnchorage, AK

ASJC Scopus subject areas

  • ソフトウェア
  • 制御およびシステム工学
  • 人工知能
  • 電子工学および電気工学


「Improvement in listening capability for humanoid robot HRP-2」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。