Bayesian unification of sound source localization and separation with permutation resolution

Takuma Otsuka, Katsuhiko Ishiguro, Hiroshi Sawada, Hiroshi G. Okuno

Research output: Chapter in Book/Report/Conference proceedingConference contribution

9 Citations (Scopus)

Abstract

Sound source localization and separation with permutation resolution are essential for achieving a computational auditory scene analysis system that can extract useful information from a mixture of various sounds. Because existing methods cope separately with these problems despite their mutual dependence, the overall result with these approaches can be degraded by any failure in one of these components. This paper presents a unified Bayesian framework to solve these problems simultaneously where localization and separation are regarded as a clustering problem. Experimental results confirm that our method outperforms state-of-the-art methods in terms of the separation quality with various setups including practical reverberant environments.

Original languageEnglish
Title of host publicationProceedings of the National Conference on Artificial Intelligence
Pages2038-2045
Number of pages8
Volume3
Publication statusPublished - 2012
Externally publishedYes
Event26th AAAI Conference on Artificial Intelligence and the 24th Innovative Applications of Artificial Intelligence Conference, AAAI-12 / IAAI-12 - Toronto, ON
Duration: 2012 Jul 222012 Jul 26

Other

Other26th AAAI Conference on Artificial Intelligence and the 24th Innovative Applications of Artificial Intelligence Conference, AAAI-12 / IAAI-12
CityToronto, ON
Period12/7/2212/7/26

Fingerprint

Acoustic waves

ASJC Scopus subject areas

  • Software
  • Artificial Intelligence

Cite this

Otsuka, T., Ishiguro, K., Sawada, H., & Okuno, H. G. (2012). Bayesian unification of sound source localization and separation with permutation resolution. In Proceedings of the National Conference on Artificial Intelligence (Vol. 3, pp. 2038-2045)

Bayesian unification of sound source localization and separation with permutation resolution. / Otsuka, Takuma; Ishiguro, Katsuhiko; Sawada, Hiroshi; Okuno, Hiroshi G.

Proceedings of the National Conference on Artificial Intelligence. Vol. 3 2012. p. 2038-2045.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Otsuka, T, Ishiguro, K, Sawada, H & Okuno, HG 2012, Bayesian unification of sound source localization and separation with permutation resolution. in Proceedings of the National Conference on Artificial Intelligence. vol. 3, pp. 2038-2045, 26th AAAI Conference on Artificial Intelligence and the 24th Innovative Applications of Artificial Intelligence Conference, AAAI-12 / IAAI-12, Toronto, ON, 12/7/22.
Otsuka T, Ishiguro K, Sawada H, Okuno HG. Bayesian unification of sound source localization and separation with permutation resolution. In Proceedings of the National Conference on Artificial Intelligence. Vol. 3. 2012. p. 2038-2045
Otsuka, Takuma ; Ishiguro, Katsuhiko ; Sawada, Hiroshi ; Okuno, Hiroshi G. / Bayesian unification of sound source localization and separation with permutation resolution. Proceedings of the National Conference on Artificial Intelligence. Vol. 3 2012. pp. 2038-2045
@inproceedings{23827f9c852f4b42b1a73a9cba21b36d,
title = "Bayesian unification of sound source localization and separation with permutation resolution",
abstract = "Sound source localization and separation with permutation resolution are essential for achieving a computational auditory scene analysis system that can extract useful information from a mixture of various sounds. Because existing methods cope separately with these problems despite their mutual dependence, the overall result with these approaches can be degraded by any failure in one of these components. This paper presents a unified Bayesian framework to solve these problems simultaneously where localization and separation are regarded as a clustering problem. Experimental results confirm that our method outperforms state-of-the-art methods in terms of the separation quality with various setups including practical reverberant environments.",
author = "Takuma Otsuka and Katsuhiko Ishiguro and Hiroshi Sawada and Okuno, {Hiroshi G.}",
year = "2012",
language = "English",
isbn = "9781577355687",
volume = "3",
pages = "2038--2045",
booktitle = "Proceedings of the National Conference on Artificial Intelligence",

}

TY - GEN

T1 - Bayesian unification of sound source localization and separation with permutation resolution

AU - Otsuka, Takuma

AU - Ishiguro, Katsuhiko

AU - Sawada, Hiroshi

AU - Okuno, Hiroshi G.

PY - 2012

Y1 - 2012

N2 - Sound source localization and separation with permutation resolution are essential for achieving a computational auditory scene analysis system that can extract useful information from a mixture of various sounds. Because existing methods cope separately with these problems despite their mutual dependence, the overall result with these approaches can be degraded by any failure in one of these components. This paper presents a unified Bayesian framework to solve these problems simultaneously where localization and separation are regarded as a clustering problem. Experimental results confirm that our method outperforms state-of-the-art methods in terms of the separation quality with various setups including practical reverberant environments.

AB - Sound source localization and separation with permutation resolution are essential for achieving a computational auditory scene analysis system that can extract useful information from a mixture of various sounds. Because existing methods cope separately with these problems despite their mutual dependence, the overall result with these approaches can be degraded by any failure in one of these components. This paper presents a unified Bayesian framework to solve these problems simultaneously where localization and separation are regarded as a clustering problem. Experimental results confirm that our method outperforms state-of-the-art methods in terms of the separation quality with various setups including practical reverberant environments.

UR - http://www.scopus.com/inward/record.url?scp=84868297387&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84868297387&partnerID=8YFLogxK

M3 - Conference contribution

SN - 9781577355687

VL - 3

SP - 2038

EP - 2045

BT - Proceedings of the National Conference on Artificial Intelligence

ER -