Improvement of speaker localization by considering multipath interference of sound wave for binaural robot audition

Ui Hyun Kim, Takeshi Mizumoto, Tetsuya Ogata, Hiroshi G. Okuno

研究成果: Conference contribution

8 被引用数 (Scopus)

抄録

This paper presents an improved speaker localization method based on the generalized cross-correlation (GCC) method weighted by the phase transform (PHAT) for binaural robot audition. The problem with the conventional direction-of-arrival (DOA) estimation based on the GCC-PHAT method is a multipath interference whereby a sound wave travels to microphones via the front-head path and the back-head path in binaural robot audition. This paper describes a new time delay factor for the GCC-PHAT method to compensate multipath interference on the assumption of spherical robot head. In addition, the restriction of the time difference of arrival (TDOA) estimation by the sampling frequency is also solved by applying the maximum likelihood (ML) estimation in frequency domain. Experiments conducted in the SIG-2 humanoid robot show that the proposed method reduces localization errors by 17.8 degrees on average and by over 35 degrees in side directions comparing to the conventional DOA estimation.

本文言語English
ホスト出版物のタイトルIROS'11 - 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems
ホスト出版物のサブタイトルCelebrating 50 Years of Robotics
ページ2910-2915
ページ数6
DOI
出版ステータスPublished - 2011 12 29
外部発表はい
イベント2011 IEEE/RSJ International Conference on Intelligent Robots and Systems: Celebrating 50 Years of Robotics, IROS'11 - San Francisco, CA, United States
継続期間: 2011 9 252011 9 30

出版物シリーズ

名前IEEE International Conference on Intelligent Robots and Systems

Conference

Conference2011 IEEE/RSJ International Conference on Intelligent Robots and Systems: Celebrating 50 Years of Robotics, IROS'11
国/地域United States
CitySan Francisco, CA
Period11/9/2511/9/30

ASJC Scopus subject areas

  • 制御およびシステム工学
  • ソフトウェア
  • コンピュータ ビジョンおよびパターン認識
  • コンピュータ サイエンスの応用

フィンガープリント

「Improvement of speaker localization by considering multipath interference of sound wave for binaural robot audition」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル