Robust recognition of simultaneous speech by a mobile robot

Jean Marc Valin, Shun'ichi Yamamoto, Jean Rouat, François Michaud, Kazuhiro Nakadai, Hiroshi G. Okuno

研究成果: Article査読

62 被引用数 (Scopus)

抄録

This paper describes a system that gives a mobile robot the ability to perform automatic speech recognition with simultaneous speakers. A microphone array is used along with a real-time implementation of geometric source separation (GSS) and a postfilter that gives a further reduction of interference from other sources. The postfilter is also used to estimate the reliability of spectral features and compute a missing feature mask. The mask is used in a missing feature theory-based speech recognition system to recognize the speech from simultaneous Japanese speakers in the context of a humanoid robot. Recognition rates are presented for three simultaneous speakers located at 2 m from the robot. The system was evaluated on a 200-word vocabulary at different azimuths between sources, ranging from 10° to 90°. Compared to the use of the microphone array source separation alone, we demonstrate an average reduction in relative recognition error rate of 24% with the postfilter and of 42% when the missing features approach is combined with the postfilter. We demonstrate the effectiveness of our multisource microphone array postfilter and the improvement it provides when used in conjunction with the missing features theory.

本文言語English
ページ(範囲)742-752
ページ数11
ジャーナルIEEE Transactions on Robotics
23
4
DOI
出版ステータスPublished - 2007 8
外部発表はい

ASJC Scopus subject areas

  • Control and Systems Engineering
  • Electrical and Electronic Engineering

フィンガープリント 「Robust recognition of simultaneous speech by a mobile robot」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル