Frequency-domain blind source separation of many speech signals using near-field and far-field models

Ryo Mukai*, Hiroshi Sawada, Shoko Araki, Shoji Makino

*この研究の対応する著者

研究成果: Article査読

23 被引用数 (Scopus)

抄録

We discuss the frequency-domain blind source separation (BSS) of convolutive mixtures when the number of source signals is large,and the potential source locations are omnidirectional. The most critical problem related to the frequency-domain BSS is the permutation problem, and geometric information is helpful as regards solving it. In this paper, we propose a method for obtaining proper geometric information with which to solve the permutation problem when the number of source signals is large and some of the signals come from the same or a similar direction.First, we describe a method for estimating the absolute DOA by using relative DOAs obtained by the solution provided by independent component analysis (ICA) and the far-field model.Next, we propose a method for estimating the spheres on which source signals exist by using ICA solution and the near-field model. We also address another problem with regard to frequency-domain BSS that arises from the circularity of discrete-frequency representation. We discuss the characteristics of the problem and present a solution for solving it. Experimental results using eight microphones in a room show that the proposed method can separate a mixture of six speech signals arriving from various directions, even when two of them come from the same direction.£.

本文言語English
論文番号83683
ジャーナルEurasip Journal on Applied Signal Processing
2006
DOI
出版ステータスPublished - 2006
外部発表はい

ASJC Scopus subject areas

  • 信号処理
  • ハードウェアとアーキテクチャ
  • 電子工学および電気工学

フィンガープリント

「Frequency-domain blind source separation of many speech signals using near-field and far-field models」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル