Design and evaluation of two-channel-based sound source localization over entire azimuth range for moving talkers

Hyun Don Kim, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno

Research output: Chapter in Book/Report/Conference proceedingConference contribution

6 Citations (Scopus)

Abstract

We propose a way to evaluate various sound localization systems for moving sounds under the same conditions. To construct a database for moving sounds, we developed a moving sound creation tool using the API library developed by the ARINIS Company. We developed a two-channel-based sound source localization system integrated with a cross-power spectrum phase (CSP) analysis and EM algorithm. The CSP of sound signals obtained with only two microphones is used to localize the sound source without having to use prior information such as impulse response data. The EM algorithm helps the system cope with several moving sound sources and reduce localization error. We evaluated our sound localization method using artificial moving sounds and confirmed that it can well localize moving sounds slower than 1.125 rad/sec. Finally, we solve the problem of distinguishing whether sounds are coming from the front or back by rotating a robot's head equipped with only two microphones. Our system was applied to a humanoid robot called SIG2, and we confirmed its ability to localize sounds over the entire azimuth range.

Original languageEnglish
Title of host publication2008 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS
Pages2197-2203
Number of pages7
DOIs
Publication statusPublished - 2008
Externally publishedYes
Event2008 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS - Nice
Duration: 2008 Sep 222008 Sep 26

Other

Other2008 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS
CityNice
Period08/9/2208/9/26

Fingerprint

Acoustic waves
Microphones
Power spectrum
Robots
Information use
Impulse response
Application programming interfaces (API)

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Vision and Pattern Recognition
  • Control and Systems Engineering
  • Electrical and Electronic Engineering

Cite this

Kim, H. D., Komatani, K., Ogata, T., & Okuno, H. G. (2008). Design and evaluation of two-channel-based sound source localization over entire azimuth range for moving talkers. In 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS (pp. 2197-2203). [4650947] https://doi.org/10.1109/IROS.2008.4650947

Design and evaluation of two-channel-based sound source localization over entire azimuth range for moving talkers. / Kim, Hyun Don; Komatani, Kazunori; Ogata, Tetsuya; Okuno, Hiroshi G.

2008 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS. 2008. p. 2197-2203 4650947.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Kim, HD, Komatani, K, Ogata, T & Okuno, HG 2008, Design and evaluation of two-channel-based sound source localization over entire azimuth range for moving talkers. in 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS., 4650947, pp. 2197-2203, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS, Nice, 08/9/22. https://doi.org/10.1109/IROS.2008.4650947
Kim HD, Komatani K, Ogata T, Okuno HG. Design and evaluation of two-channel-based sound source localization over entire azimuth range for moving talkers. In 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS. 2008. p. 2197-2203. 4650947 https://doi.org/10.1109/IROS.2008.4650947
Kim, Hyun Don ; Komatani, Kazunori ; Ogata, Tetsuya ; Okuno, Hiroshi G. / Design and evaluation of two-channel-based sound source localization over entire azimuth range for moving talkers. 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS. 2008. pp. 2197-2203
@inproceedings{05f5e078db2c4e8b88e356e285470d8d,
title = "Design and evaluation of two-channel-based sound source localization over entire azimuth range for moving talkers",
abstract = "We propose a way to evaluate various sound localization systems for moving sounds under the same conditions. To construct a database for moving sounds, we developed a moving sound creation tool using the API library developed by the ARINIS Company. We developed a two-channel-based sound source localization system integrated with a cross-power spectrum phase (CSP) analysis and EM algorithm. The CSP of sound signals obtained with only two microphones is used to localize the sound source without having to use prior information such as impulse response data. The EM algorithm helps the system cope with several moving sound sources and reduce localization error. We evaluated our sound localization method using artificial moving sounds and confirmed that it can well localize moving sounds slower than 1.125 rad/sec. Finally, we solve the problem of distinguishing whether sounds are coming from the front or back by rotating a robot's head equipped with only two microphones. Our system was applied to a humanoid robot called SIG2, and we confirmed its ability to localize sounds over the entire azimuth range.",
author = "Kim, {Hyun Don} and Kazunori Komatani and Tetsuya Ogata and Okuno, {Hiroshi G.}",
year = "2008",
doi = "10.1109/IROS.2008.4650947",
language = "English",
isbn = "9781424420582",
pages = "2197--2203",
booktitle = "2008 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS",

}

TY - GEN

T1 - Design and evaluation of two-channel-based sound source localization over entire azimuth range for moving talkers

AU - Kim, Hyun Don

AU - Komatani, Kazunori

AU - Ogata, Tetsuya

AU - Okuno, Hiroshi G.

PY - 2008

Y1 - 2008

N2 - We propose a way to evaluate various sound localization systems for moving sounds under the same conditions. To construct a database for moving sounds, we developed a moving sound creation tool using the API library developed by the ARINIS Company. We developed a two-channel-based sound source localization system integrated with a cross-power spectrum phase (CSP) analysis and EM algorithm. The CSP of sound signals obtained with only two microphones is used to localize the sound source without having to use prior information such as impulse response data. The EM algorithm helps the system cope with several moving sound sources and reduce localization error. We evaluated our sound localization method using artificial moving sounds and confirmed that it can well localize moving sounds slower than 1.125 rad/sec. Finally, we solve the problem of distinguishing whether sounds are coming from the front or back by rotating a robot's head equipped with only two microphones. Our system was applied to a humanoid robot called SIG2, and we confirmed its ability to localize sounds over the entire azimuth range.

AB - We propose a way to evaluate various sound localization systems for moving sounds under the same conditions. To construct a database for moving sounds, we developed a moving sound creation tool using the API library developed by the ARINIS Company. We developed a two-channel-based sound source localization system integrated with a cross-power spectrum phase (CSP) analysis and EM algorithm. The CSP of sound signals obtained with only two microphones is used to localize the sound source without having to use prior information such as impulse response data. The EM algorithm helps the system cope with several moving sound sources and reduce localization error. We evaluated our sound localization method using artificial moving sounds and confirmed that it can well localize moving sounds slower than 1.125 rad/sec. Finally, we solve the problem of distinguishing whether sounds are coming from the front or back by rotating a robot's head equipped with only two microphones. Our system was applied to a humanoid robot called SIG2, and we confirmed its ability to localize sounds over the entire azimuth range.

UR - http://www.scopus.com/inward/record.url?scp=69549103385&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=69549103385&partnerID=8YFLogxK

U2 - 10.1109/IROS.2008.4650947

DO - 10.1109/IROS.2008.4650947

M3 - Conference contribution

SN - 9781424420582

SP - 2197

EP - 2203

BT - 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS

ER -