Variational Bayesian multi-channel robust NMF for human-voice enhancement with a deformable and partially-occluded microphone array

Yoshiaki Bando, Katsutoshi Itoyama, Masashi Konyo, Satoshi Tadokoro, Kazuhiro Nakadai, Kazuyoshi Yoshii, Hiroshi G. Okuno

    研究成果: Conference contribution

    6 被引用数 (Scopus)

    抄録

    This paper presents a human-voice enhancement method for a deformable and partially-occluded microphone array. Although microphone arrays distributed on the long bodies of hose-shaped rescue robots are crucial for finding victims under collapsed buildings, human voices captured by a microphone array are contaminated by non-stationary actuator and friction noise. Standard blind source separation methods cannot be used because the relative microphone positions change over time and some of them are occasionally shaded by rubble. To solve these problems, we develop a Bayesian model that separates multichannel amplitude spectrograms into sparse and low-rank components (human voice and noise) without using phase information, which depends on the array layout. The voice level at each microphone is estimated in a time-varying manner for reducing the influence of the shaded microphones. Experiments using a 3-m hose-shaped robot with eight microphones show that our method outperforms conventional methods by the signal-to-noise ratio of 2.7 dB.

    本文言語English
    ホスト出版物のタイトル2016 24th European Signal Processing Conference, EUSIPCO 2016
    出版社European Signal Processing Conference, EUSIPCO
    ページ1018-1022
    ページ数5
    2016-November
    ISBN(電子版)9780992862657
    DOI
    出版ステータスPublished - 2016 11 28
    イベント24th European Signal Processing Conference, EUSIPCO 2016 - Budapest, Hungary
    継続期間: 2016 8 282016 9 2

    Other

    Other24th European Signal Processing Conference, EUSIPCO 2016
    国/地域Hungary
    CityBudapest
    Period16/8/2816/9/2

    ASJC Scopus subject areas

    • 信号処理
    • 電子工学および電気工学

    フィンガープリント

    「Variational Bayesian multi-channel robust NMF for human-voice enhancement with a deformable and partially-occluded microphone array」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

    引用スタイル