Exploiting auditory fovea in humanoid-human interaction

Kazuhiro Nakadai*, Hiroshi G. Okuno, Hiroaki Kitano

*この研究の対応する著者

研究成果: Conference contribution

14 被引用数 (Scopus)

抄録

A robot's auditory perception of the real world should be able to cope with motor and other noises caused by the robot's own movements in addition to environment noises and reverberation. This paper presents the active direction-pass filter (ADPF) that separates sounds originating from a specified direction detected by a pair of microphones. Thus the ADPF is based on directional processing - a process used in visual processing. The ADPF is implemented by hierarchical integration of visual and auditory processing with hypothetical reasoning of interaural phase difference (IPD) and interaural intensity difference (IID) for each sub-band. The ADPF gives differences in resolution in sound localization and separation depending on where the sound comes from: the resolving power is much higher for sounds coming directly from the front of the humanoid than for sounds coming from the periphery. This directional resolving property is similar to that of the eye whereby the visual fovea at the center of the retina is capable of much higher resolution than is the periphery of the retina. To exploit the corresponding "auditory fovea", the ADPF controls the direction of the head. The human tracking and sound source separation based on the ADPF is implemented on the upper-torso of the humanoid and runs in real-time using distributed processing by 5 PCs networked via a gigabit ethernet. The signal-to-noise ratio (SNR) and noise reduction ratio of each sound separated by the ADPF from a mixture of two or three speeches of the same volume were increased by about 2.2 dB and 9 dB, respectively.

本文言語English
ホスト出版物のタイトルProceedings of the National Conference on Artificial Intelligence
ページ431-438
ページ数8
出版ステータスPublished - 2002
外部発表はい
イベント18th National Conference on Artificial Intelligence (AAAI-02), 14th Innovative Applications of Artificial Intelligence Conference (IAAI-02) - Edmonton, Alta.
継続期間: 2002 7 282002 8 1

Other

Other18th National Conference on Artificial Intelligence (AAAI-02), 14th Innovative Applications of Artificial Intelligence Conference (IAAI-02)
CityEdmonton, Alta.
Period02/7/2802/8/1

ASJC Scopus subject areas

  • ソフトウェア

フィンガープリント

「Exploiting auditory fovea in humanoid-human interaction」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル