Improvement in listening capability for humanoid robot HRP-2

Toru Takahashi, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno

研究成果: Conference contribution

6 引用 (Scopus)

抜粋

This paper describes improvement of sound source separation for a simultaneous automatic speech recognition (ASR) system of a humanoid robot. A recognition error in the system is caused by a separation error and interferences of other sources. In separability, an original geometric source separation (GSS) is improved. Our GSS uses a measured robot's head related transfer function (HRTF) to estimate a separation matrix. As an original GSS uses a simulated HRTF calculated based on a distance between microphone and sound source, there is a large mismatch between the simulated and the measured transfer functions. The mismatch causes a severe degradation of recognition performance. Faster convergence speed of separation matrix reduces separation error. Our approach gives a nearer initial separation matrix based on a measured transfer function from an optimal separation matrix than a simulated one. As a result, we expect that our GSS improves the convergence speed. Our GSS is also able to handle an adaptive step-size parameter. These new features are added into open source robot audition software (OSS) called "HARK" which is newly updated as version 1.0.0. The HARK has been installed on a HRP-2 humanoid with an 8-element microphone array. The listening capability of HRP-2 is evaluated by recognizing a target speech signal which is separated from a simultaneous speech signal by three talkers. The word correct rate (WCR) of ASR improves by 5 points under normal acoustic environments and by 10 points under noisy environments. Experimental results show that HARK 1.0.0 improves the robustness against noises.

元の言語English
ホスト出版物のタイトル2010 IEEE International Conference on Robotics and Automation, ICRA 2010
ページ470-475
ページ数6
DOI
出版物ステータスPublished - 2010 8 26
イベント2010 IEEE International Conference on Robotics and Automation, ICRA 2010 - Anchorage, AK, United States
継続期間: 2010 5 32010 5 7

出版物シリーズ

名前Proceedings - IEEE International Conference on Robotics and Automation
ISSN(印刷物)1050-4729

Conference

Conference2010 IEEE International Conference on Robotics and Automation, ICRA 2010
United States
Anchorage, AK
期間10/5/310/5/7

    フィンガープリント

ASJC Scopus subject areas

  • Software
  • Control and Systems Engineering
  • Artificial Intelligence
  • Electrical and Electronic Engineering

これを引用

Takahashi, T., Nakadai, K., Komatani, K., Ogata, T., & Okuno, H. G. (2010). Improvement in listening capability for humanoid robot HRP-2. : 2010 IEEE International Conference on Robotics and Automation, ICRA 2010 (pp. 470-475). [5509830] (Proceedings - IEEE International Conference on Robotics and Automation). https://doi.org/10.1109/ROBOT.2010.5509830