Environmental sound recognition for robot audition using matching-pursuit

Nobuhide Yamakawa, Toru Takahashi, Tetsuro Kitahara, Tetsuya Ogata, Hiroshi G. Okuno

Research output: Chapter in Book/Report/Conference proceedingConference contribution

9 Citations (Scopus)

Abstract

Our goal is to achieve a robot audition system that is capable of recognizing multiple environmental sounds and making use of them in human-robot interaction. The main problems in environmental sound recognition in robot audition are: (1) recognition under a large amount of background noise including the noise from the robot itself, and (2) the necessity of robust feature extraction against spectrum distortion due to separation of multiple sound sources. This paper presents the environmental recognition of two sound sources fired simultaneously using matching pursuit (MP) with the Gabor wavelet, which extracts salient audio features from a signal. The two environmental sounds come from different directions, and they are localized by multiple signal classification and, using their geometric information, separated by geometric source separation with the aid of measured head-related transfer functions. The experimental results show the noise-robustness of MP although the performance depends on the properties of the sound sources.

Original languageEnglish
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Pages1-10
Number of pages10
Volume6704 LNAI
EditionPART 2
DOIs
Publication statusPublished - 2011
Externally publishedYes
Event24th International Conference on Industrial Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2011 - Syracuse, NY
Duration: 2011 Jun 282011 Jul 1

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 2
Volume6704 LNAI
ISSN (Print)03029743
ISSN (Electronic)16113349

Other

Other24th International Conference on Industrial Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2011
CitySyracuse, NY
Period11/6/2811/7/1

Fingerprint

Matching Pursuit
Audition
Robot
Acoustic waves
Robots
Noise Robustness
Gabor Wavelet
Source separation
Source Separation
Human-robot Interaction
Human robot interaction
Transfer Function
Feature Extraction
Transfer functions
Sound
Feature extraction
Experimental Results

Keywords

  • Computational auditory scene analysis
  • Environmental sound recognition
  • Matching pursuit
  • Robot audition

ASJC Scopus subject areas

  • Computer Science(all)
  • Theoretical Computer Science

Cite this

Yamakawa, N., Takahashi, T., Kitahara, T., Ogata, T., & Okuno, H. G. (2011). Environmental sound recognition for robot audition using matching-pursuit. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (PART 2 ed., Vol. 6704 LNAI, pp. 1-10). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 6704 LNAI, No. PART 2). https://doi.org/10.1007/978-3-642-21827-9_1

Environmental sound recognition for robot audition using matching-pursuit. / Yamakawa, Nobuhide; Takahashi, Toru; Kitahara, Tetsuro; Ogata, Tetsuya; Okuno, Hiroshi G.

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 6704 LNAI PART 2. ed. 2011. p. 1-10 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 6704 LNAI, No. PART 2).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Yamakawa, N, Takahashi, T, Kitahara, T, Ogata, T & Okuno, HG 2011, Environmental sound recognition for robot audition using matching-pursuit. in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). PART 2 edn, vol. 6704 LNAI, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), no. PART 2, vol. 6704 LNAI, pp. 1-10, 24th International Conference on Industrial Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2011, Syracuse, NY, 11/6/28. https://doi.org/10.1007/978-3-642-21827-9_1
Yamakawa N, Takahashi T, Kitahara T, Ogata T, Okuno HG. Environmental sound recognition for robot audition using matching-pursuit. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). PART 2 ed. Vol. 6704 LNAI. 2011. p. 1-10. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); PART 2). https://doi.org/10.1007/978-3-642-21827-9_1
Yamakawa, Nobuhide ; Takahashi, Toru ; Kitahara, Tetsuro ; Ogata, Tetsuya ; Okuno, Hiroshi G. / Environmental sound recognition for robot audition using matching-pursuit. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 6704 LNAI PART 2. ed. 2011. pp. 1-10 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); PART 2).
@inproceedings{c33c9d4ea72b43c597de1a046323023c,
title = "Environmental sound recognition for robot audition using matching-pursuit",
abstract = "Our goal is to achieve a robot audition system that is capable of recognizing multiple environmental sounds and making use of them in human-robot interaction. The main problems in environmental sound recognition in robot audition are: (1) recognition under a large amount of background noise including the noise from the robot itself, and (2) the necessity of robust feature extraction against spectrum distortion due to separation of multiple sound sources. This paper presents the environmental recognition of two sound sources fired simultaneously using matching pursuit (MP) with the Gabor wavelet, which extracts salient audio features from a signal. The two environmental sounds come from different directions, and they are localized by multiple signal classification and, using their geometric information, separated by geometric source separation with the aid of measured head-related transfer functions. The experimental results show the noise-robustness of MP although the performance depends on the properties of the sound sources.",
keywords = "Computational auditory scene analysis, Environmental sound recognition, Matching pursuit, Robot audition",
author = "Nobuhide Yamakawa and Toru Takahashi and Tetsuro Kitahara and Tetsuya Ogata and Okuno, {Hiroshi G.}",
year = "2011",
doi = "10.1007/978-3-642-21827-9_1",
language = "English",
isbn = "9783642218262",
volume = "6704 LNAI",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
number = "PART 2",
pages = "1--10",
booktitle = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
edition = "PART 2",

}

TY - GEN

T1 - Environmental sound recognition for robot audition using matching-pursuit

AU - Yamakawa, Nobuhide

AU - Takahashi, Toru

AU - Kitahara, Tetsuro

AU - Ogata, Tetsuya

AU - Okuno, Hiroshi G.

PY - 2011

Y1 - 2011

N2 - Our goal is to achieve a robot audition system that is capable of recognizing multiple environmental sounds and making use of them in human-robot interaction. The main problems in environmental sound recognition in robot audition are: (1) recognition under a large amount of background noise including the noise from the robot itself, and (2) the necessity of robust feature extraction against spectrum distortion due to separation of multiple sound sources. This paper presents the environmental recognition of two sound sources fired simultaneously using matching pursuit (MP) with the Gabor wavelet, which extracts salient audio features from a signal. The two environmental sounds come from different directions, and they are localized by multiple signal classification and, using their geometric information, separated by geometric source separation with the aid of measured head-related transfer functions. The experimental results show the noise-robustness of MP although the performance depends on the properties of the sound sources.

AB - Our goal is to achieve a robot audition system that is capable of recognizing multiple environmental sounds and making use of them in human-robot interaction. The main problems in environmental sound recognition in robot audition are: (1) recognition under a large amount of background noise including the noise from the robot itself, and (2) the necessity of robust feature extraction against spectrum distortion due to separation of multiple sound sources. This paper presents the environmental recognition of two sound sources fired simultaneously using matching pursuit (MP) with the Gabor wavelet, which extracts salient audio features from a signal. The two environmental sounds come from different directions, and they are localized by multiple signal classification and, using their geometric information, separated by geometric source separation with the aid of measured head-related transfer functions. The experimental results show the noise-robustness of MP although the performance depends on the properties of the sound sources.

KW - Computational auditory scene analysis

KW - Environmental sound recognition

KW - Matching pursuit

KW - Robot audition

UR - http://www.scopus.com/inward/record.url?scp=79960540012&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=79960540012&partnerID=8YFLogxK

U2 - 10.1007/978-3-642-21827-9_1

DO - 10.1007/978-3-642-21827-9_1

M3 - Conference contribution

SN - 9783642218262

VL - 6704 LNAI

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 1

EP - 10

BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

ER -