Computational Auditory Scene Analysis and Its Application to Robot Audition

Hiroshi G. Okuno*, Tetsuya Ogata, Kazunori Komatani, Kazuhiro Nakadai

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

15 Citations (Scopus)

Abstract

We are engaged in research on computational auditory scene analysis to attain sophisticated robot (computer) human interaction by recognizing auditory awareness. The objective of our research is the understanding of an arbitrary sound mixture including non-speech sounds and music as well as voiced speech, obtained by robot's ears (or microphones embedded in the robot). The main issues are sound source localization, separation, and recognition at signal processing levels, and signal-to-symbol transformation at the interface level to symbol processing levels. The latter is critical in developmental communication and we are developing an automatic onomatopoeia recognition system. This paper overviews our activities in robot audition, in particular, active direction-pass filter (ADPF) that separates sounds originating from a specific direction by integrating sound source localization and visual processing. ADPF is implemented on three kinds of robots and demonstrates separating and recognizing three simultaneous speeches with a pair of microphones.

Original languageEnglish
Title of host publicationProceedings - International Conference on Informatics Research for Development of Knowledge Society Infrastructure, ICKS 2004
EditorsT. Ibaraki, T. Inui, K. Tanaka
Pages73-80
Number of pages8
DOIs
Publication statusPublished - 2004 Dec 27
Externally publishedYes
EventProceedings - International Conference on Informatics Research for Development of Knowledge Society Infrastructure, ICKS 2004 - Kyoto, Japan
Duration: 2004 Mar 12004 Mar 2

Publication series

NameProceedings - International Conference on Informatics Research for Development of Knowledge Society Infrastructure, ICKS 2004

Conference

ConferenceProceedings - International Conference on Informatics Research for Development of Knowledge Society Infrastructure, ICKS 2004
Country/TerritoryJapan
CityKyoto
Period04/3/104/3/2

ASJC Scopus subject areas

  • Engineering(all)

Fingerprint

Dive into the research topics of 'Computational Auditory Scene Analysis and Its Application to Robot Audition'. Together they form a unique fingerprint.

Cite this