Speech recognition for a humanoid with motor noise utilizing missing feature theory

Yoshitaka Nishimura*, Mitsuru Ishizuka, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino

*この研究の対応する著者

研究成果: Conference contribution

13 被引用数 (Scopus)

抄録

Automatic speech recognition (ASR) is essential for a human-humanoid communication. One of the main problems with ASR is that a humanoid inevitably generates motor noises. These noises are easily captured by the humanoid's microphones because the noise sources are closer to the microphones than the target speech source. Thus, the signal-to-noise ratio (SNR) of input speech becomes quite low (sometimes less than 0 dB). However, it is possible to estimate these noises by using information about the humanoid's own motions and gestures. In this paper we propose a method to improve ASR for a humanoid with motor noises by utilizing the information about the humanoid's motions/gestures. The method consists of psychologically-inspired noise suppression and missing-feature-theory-based ASR (MFT-ASR). The proposed noise suppression technique adds white noise after noise suppression which does not improve SNR, but it is suitable for MFT-ASR. This is inspired by the fact that noise addition sometimes helps human perception as described in Gestalt psychology. MFT-ASR improves ASR by masking unreliable acoustic features in the input sound. The information obtained on motion/gesture is used for estimating reliability of acoustic features in MFT-ASR. We evaluated the proposed method with noisy speech recorded by Honda ASIMO in a room with reverberation. The noise data contained 32 kinds of noises: motor noises without motions, gesture noises, walking noises, and so on. The experimental results show that the proposed method outperforms the conventional multi-condition training technique.

本文言語English
ホスト出版物のタイトルProceedings of the 2006 6th IEEE-RAS International Conference on Humanoid Robots, HUMANOIDS
ページ26-33
ページ数8
DOI
出版ステータスPublished - 2006
外部発表はい
イベント2006 6th IEEE-RAS International Conference on Humanoid Robots, HUMANOIDS - Genoa
継続期間: 2006 12 42006 12 6

Other

Other2006 6th IEEE-RAS International Conference on Humanoid Robots, HUMANOIDS
CityGenoa
Period06/12/406/12/6

ASJC Scopus subject areas

  • 人間とコンピュータの相互作用
  • 電子工学および電気工学

フィンガープリント

「Speech recognition for a humanoid with motor noise utilizing missing feature theory」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル