Challenges in deploying a microphone array to localize and separate sound sources in real auditory scenes

Yoshiaki Bando, Takuma Otsuka, Katsutoshi Itoyama, Kazuyoshi Yoshii, Yoko Sasaki, Satoshi Kagami, Hiroshi G. Okuno

    研究成果: Conference contribution

    3 被引用数 (Scopus)

    抄録

    Analyzing the auditory scene of real environments is challenging partly because an unknown number and type of sound sources are observed at the same time and partly because these sounds are observed on a significantly different sound pressure level at the microphone. These are difficult problems even with state-of-the-art sound source localization and separation methods. In this paper, we exploit two such methods using a microphone array: (1) Bayesian nonparametric microphone array processing (BNP-MAP), which is capable of separating and localizing sound sources when the number of sound sources is unspecified, and (2) robot audition software 'HARK' is capable of separating and localizing in real time. Through experimentation, we found that BNP-MAP is more robust against differences in the sound pressure levels of the source signals and in the spatial closeness of source positions. Experiments analyzing real scenes of human conversations recorded in a big exhibition hall and bird calling recorded at a natural park demonstrate the efficacy and applicability of BNP-MAP.

    本文言語English
    ホスト出版物のタイトルICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
    出版社Institute of Electrical and Electronics Engineers Inc.
    ページ723-727
    ページ数5
    2015-August
    ISBN(印刷版)9781467369978
    DOI
    出版ステータスPublished - 2015 8 4
    イベント40th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2015 - Brisbane, Australia
    継続期間: 2014 4 192014 4 24

    Other

    Other40th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2015
    国/地域Australia
    CityBrisbane
    Period14/4/1914/4/24

    ASJC Scopus subject areas

    • 信号処理
    • ソフトウェア
    • 電子工学および電気工学

    フィンガープリント

    「Challenges in deploying a microphone array to localize and separate sound sources in real auditory scenes」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

    引用スタイル