Computational auditory scene analysis and its application to robot audition: Five years experience

Hiroshi G. Okuno, Tetsuya Ogata, Kazunori Komatani

Research output: Chapter in Book/Report/Conference proceedingConference contribution

17 Citations (Scopus)

Abstract

We have been engaged in research on computational auditory scene analysis to attain sophisticated robot/computer human interaction by manipulating real-world sound signals. The objective of our research is the understanding of an arbitrary sound mixture including non-speech sounds and music as well as voiced speech, obtained by robot's ears, that is, microphones embedded in the robot. We have coped with three main issues in computational auditory scene analysis, that is, sound source localization, separation, and recognition of separated sounds for a mixture of speech signals as well as polyphonic music signals. This paper overviews our results in robot audition, in particular, Missing Feature Theory based integration of sound source separation and automatic speech recognition, and those in music information processing, in particular, drum sound equalizer.

Original languageEnglish
Title of host publicationProceedings - Second International Conference on Informatics Research for Development of Knowledge Society Infrastructure, ICKS 2007
Pages69-76
Number of pages8
DOIs
Publication statusPublished - 2007
Externally publishedYes
EventSecond International Conference on Informatics Research for Development of Knowledge Society Infrastructure, ICKS 2007 - Kyoto
Duration: 2007 Jan 292007 Jan 29

Other

OtherSecond International Conference on Informatics Research for Development of Knowledge Society Infrastructure, ICKS 2007
CityKyoto
Period07/1/2907/1/29

Fingerprint

Audition
Acoustic waves
Robots
Source separation
Equalizers
Human computer interaction
Microphones
Speech recognition

ASJC Scopus subject areas

  • Computer Science Applications
  • Information Systems
  • Software

Cite this

Okuno, H. G., Ogata, T., & Komatani, K. (2007). Computational auditory scene analysis and its application to robot audition: Five years experience. In Proceedings - Second International Conference on Informatics Research for Development of Knowledge Society Infrastructure, ICKS 2007 (pp. 69-76). [4161215] https://doi.org/10.1109/ICKS.2007.7

Computational auditory scene analysis and its application to robot audition : Five years experience. / Okuno, Hiroshi G.; Ogata, Tetsuya; Komatani, Kazunori.

Proceedings - Second International Conference on Informatics Research for Development of Knowledge Society Infrastructure, ICKS 2007. 2007. p. 69-76 4161215.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Okuno, HG, Ogata, T & Komatani, K 2007, Computational auditory scene analysis and its application to robot audition: Five years experience. in Proceedings - Second International Conference on Informatics Research for Development of Knowledge Society Infrastructure, ICKS 2007., 4161215, pp. 69-76, Second International Conference on Informatics Research for Development of Knowledge Society Infrastructure, ICKS 2007, Kyoto, 07/1/29. https://doi.org/10.1109/ICKS.2007.7
Okuno HG, Ogata T, Komatani K. Computational auditory scene analysis and its application to robot audition: Five years experience. In Proceedings - Second International Conference on Informatics Research for Development of Knowledge Society Infrastructure, ICKS 2007. 2007. p. 69-76. 4161215 https://doi.org/10.1109/ICKS.2007.7
Okuno, Hiroshi G. ; Ogata, Tetsuya ; Komatani, Kazunori. / Computational auditory scene analysis and its application to robot audition : Five years experience. Proceedings - Second International Conference on Informatics Research for Development of Knowledge Society Infrastructure, ICKS 2007. 2007. pp. 69-76
@inproceedings{4ee6fe180b504c9680443747277c6ef0,
title = "Computational auditory scene analysis and its application to robot audition: Five years experience",
abstract = "We have been engaged in research on computational auditory scene analysis to attain sophisticated robot/computer human interaction by manipulating real-world sound signals. The objective of our research is the understanding of an arbitrary sound mixture including non-speech sounds and music as well as voiced speech, obtained by robot's ears, that is, microphones embedded in the robot. We have coped with three main issues in computational auditory scene analysis, that is, sound source localization, separation, and recognition of separated sounds for a mixture of speech signals as well as polyphonic music signals. This paper overviews our results in robot audition, in particular, Missing Feature Theory based integration of sound source separation and automatic speech recognition, and those in music information processing, in particular, drum sound equalizer.",
author = "Okuno, {Hiroshi G.} and Tetsuya Ogata and Kazunori Komatani",
year = "2007",
doi = "10.1109/ICKS.2007.7",
language = "English",
isbn = "0769528112",
pages = "69--76",
booktitle = "Proceedings - Second International Conference on Informatics Research for Development of Knowledge Society Infrastructure, ICKS 2007",

}

TY - GEN

T1 - Computational auditory scene analysis and its application to robot audition

T2 - Five years experience

AU - Okuno, Hiroshi G.

AU - Ogata, Tetsuya

AU - Komatani, Kazunori

PY - 2007

Y1 - 2007

N2 - We have been engaged in research on computational auditory scene analysis to attain sophisticated robot/computer human interaction by manipulating real-world sound signals. The objective of our research is the understanding of an arbitrary sound mixture including non-speech sounds and music as well as voiced speech, obtained by robot's ears, that is, microphones embedded in the robot. We have coped with three main issues in computational auditory scene analysis, that is, sound source localization, separation, and recognition of separated sounds for a mixture of speech signals as well as polyphonic music signals. This paper overviews our results in robot audition, in particular, Missing Feature Theory based integration of sound source separation and automatic speech recognition, and those in music information processing, in particular, drum sound equalizer.

AB - We have been engaged in research on computational auditory scene analysis to attain sophisticated robot/computer human interaction by manipulating real-world sound signals. The objective of our research is the understanding of an arbitrary sound mixture including non-speech sounds and music as well as voiced speech, obtained by robot's ears, that is, microphones embedded in the robot. We have coped with three main issues in computational auditory scene analysis, that is, sound source localization, separation, and recognition of separated sounds for a mixture of speech signals as well as polyphonic music signals. This paper overviews our results in robot audition, in particular, Missing Feature Theory based integration of sound source separation and automatic speech recognition, and those in music information processing, in particular, drum sound equalizer.

UR - http://www.scopus.com/inward/record.url?scp=34548809160&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=34548809160&partnerID=8YFLogxK

U2 - 10.1109/ICKS.2007.7

DO - 10.1109/ICKS.2007.7

M3 - Conference contribution

AN - SCOPUS:34548809160

SN - 0769528112

SN - 9780769528113

SP - 69

EP - 76

BT - Proceedings - Second International Conference on Informatics Research for Development of Knowledge Society Infrastructure, ICKS 2007

ER -