A two-stage frequency-domain blind source separation method for underdetermined convolutive mixtures

Hiroshi Sawada*, Shoko Araki, Shoji Makino

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

68 Citations (Scopus)

Abstract

This paper proposes a two-stage method for the blind separation of convolutively mixed sources. We employ time-frequency masking, which can be applied even to an underdetermined case where the number of sensors is insufficient for the number of sources. In the first stage of the method, frequency bin-wise mixtures are classified based on Gaussian mixture model fitting. In the second stage, the permutation ambiguities of the bin-wise classified signals are aligned by clustering the posterior probability sequences calculated in the first stage. Experimental results for separating four speeches with three microphones under reverberant conditions show the superiority of the proposed method over existing methods based on time-difference-of-arrival estimations or signal envelope clustering.

Original languageEnglish
Title of host publication2007 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA
Pages139-142
Number of pages4
DOIs
Publication statusPublished - 2007
Externally publishedYes
Event2007 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA - New Paltz, NY, United States
Duration: 2007 Oct 212007 Oct 24

Publication series

NameIEEE Workshop on Applications of Signal Processing to Audio and Acoustics

Conference

Conference2007 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA
Country/TerritoryUnited States
CityNew Paltz, NY
Period07/10/2107/10/24

ASJC Scopus subject areas

  • Signal Processing

Fingerprint

Dive into the research topics of 'A two-stage frequency-domain blind source separation method for underdetermined convolutive mixtures'. Together they form a unique fingerprint.

Cite this