Subband based blind source separation for convolutive mixtures of speech

Shoko Araki*, Shoji Makino, Robert Aichner, Tsuyoki Nishikawa, Hiroshi Saruwatari

*この研究の対応する著者

研究成果: Conference article査読

10 被引用数 (Scopus)

抄録

Subband processing is applied to blind source separation (BSS) for convolutive mixtures of speech. This is motivated by the drawback of frequency-domain BSS, i.e., when a long frame with a fixed frame-shift is used to cover reverberation, the number of samples in each frequency decreases and the separation performance is degraded. In our proposed subband BSS, (1) by using a moderate number of subbands, a sufficient number of samples can be held in each subband, and (2) by using FIR filters in each subband, we can handle long reverberation. Subband BSS achieves better performance than frequency-domain BSS. Moreover, we propose efficient separation procedures that take into consideration the frequency characteristics of room reverberation and speech signals. We achieve this (3) by using longer unmixing filters in low frequency bands, and (4) by adopting overlap-blockshift in BSS's batch adaptation in low frequency bands. Consequently, frequency-dependent subband processing is successfully realized in the proposed subband BSS.

本文言語English
ページ(範囲)509-512
ページ数4
ジャーナルICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
5
出版ステータスPublished - 2003
外部発表はい
イベント2003 IEEE International Conference on Accoustics, Speech, and Signal Processing - Hong Kong, Hong Kong
継続期間: 2003 4月 62003 4月 10

ASJC Scopus subject areas

  • ソフトウェア
  • 信号処理
  • 電子工学および電気工学

フィンガープリント

「Subband based blind source separation for convolutive mixtures of speech」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル