Robot audition from the viewpoint of computational auditory scene analysis

Hiroshi G. Okuno, Tetsuya Ogata, Kazunori Komatani

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

We have been engaged in research on computational auditory scene analysis to attain sophisticated robot/computer human interaction by manipulating real-world sound signals. The objective of our research is the understanding of an arbitrary sound mixture including music and environmental sounds as well as voiced speech, obtained by robot's ears (microphones) embedded on the robot. Three main issues in computational auditory scene analysis are sound source localization, separation, and recognition of separated sounds for a mixture of speech signals as well as polyphonic music signals. The Missing Feature Theory (MFT) approach integrates sound source separation and automatic speech recognition by generating missing feature masks. This robot audition system has been successfully ported to three kinds of robots, SIG2, Robovie R2 and Honda ASIMO. A robot recognizes three simultaneous speeches such as placing a meal order ora referee for RockPaper-Scissors Sound Games with a delay of less than 2 seconds. The real-time beat tracking system is also developed for robot audition. A robot hears music, understands and predicts its musical beats to behave in accordance with the beat times in real-time.

Original languageEnglish
Title of host publicationProceedings - International Conference on Informatics Education and Research for Knowledge-Circulating Society, ICKS 2008
Pages35-40
Number of pages6
DOIs
Publication statusPublished - 2008 Aug 29
EventInternational Conference on Informatics Education and Research for Knowledge-Circulating Society, ICKS 2008 - Kyoto, Japan
Duration: 2008 Jan 172008 Jan 17

Publication series

NameProceedings - International Conference on Informatics Education and Research for Knowledge-Circulating Society, ICKS 2008

Other

OtherInternational Conference on Informatics Education and Research for Knowledge-Circulating Society, ICKS 2008
CountryJapan
CityKyoto
Period08/1/1708/1/17

ASJC Scopus subject areas

  • Information Systems
  • Education

Fingerprint Dive into the research topics of 'Robot audition from the viewpoint of computational auditory scene analysis'. Together they form a unique fingerprint.

  • Cite this

    Okuno, H. G., Ogata, T., & Komatani, K. (2008). Robot audition from the viewpoint of computational auditory scene analysis. In Proceedings - International Conference on Informatics Education and Research for Knowledge-Circulating Society, ICKS 2008 (pp. 35-40). [4460465] (Proceedings - International Conference on Informatics Education and Research for Knowledge-Circulating Society, ICKS 2008). https://doi.org/10.1109/ICKS.2008.10