TY - JOUR
T1 - Stereo source separation and source counting with MAP estimation with dirichlet prior considering spatial aliasing problem
AU - Araki, Shoko
AU - Nakatani, Tomohiro
AU - Sawada, Hiroshi
AU - Makino, Shoji
PY - 2009
Y1 - 2009
N2 - In this paper, we propose a novel sparse source separation method that can estimate the number of sources and time-frequency masks simultaneously, even when the spatial aliasing problem exists. Recently, many sparse source separation approaches with time-frequency masks have been proposed. However, most of these approaches require information on the number of sources in advance. In our proposed method, we model the phase difference of arrival (PDOA) between microphones with a Gaussian mixture model (GMM) with a Dirichlet prior. Then we estimate the model parameters by using the maximum a posteriori (MAP) estimation based on the EM algorithm. In order to avoid one cluster being modeled by two or more Gaussians, we utilize a sparse distribution modeled by the Dirichlet distributions as the prior of the GMM mixture weight. Moreover, to handle wide microphone spacing cases where the spatial aliasing problem occurs, the indeterminacy of modulus 27rfc in the phase is also included in our model. Experimental results show good performance of our proposed method.
AB - In this paper, we propose a novel sparse source separation method that can estimate the number of sources and time-frequency masks simultaneously, even when the spatial aliasing problem exists. Recently, many sparse source separation approaches with time-frequency masks have been proposed. However, most of these approaches require information on the number of sources in advance. In our proposed method, we model the phase difference of arrival (PDOA) between microphones with a Gaussian mixture model (GMM) with a Dirichlet prior. Then we estimate the model parameters by using the maximum a posteriori (MAP) estimation based on the EM algorithm. In order to avoid one cluster being modeled by two or more Gaussians, we utilize a sparse distribution modeled by the Dirichlet distributions as the prior of the GMM mixture weight. Moreover, to handle wide microphone spacing cases where the spatial aliasing problem occurs, the indeterminacy of modulus 27rfc in the phase is also included in our model. Experimental results show good performance of our proposed method.
KW - Blind source separation
KW - Dirichlet distribution
KW - Number of sources
KW - Prior
KW - Sparse
KW - Spatial aliasing problem
UR - http://www.scopus.com/inward/record.url?scp=67149125347&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=67149125347&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-00599-2_93
DO - 10.1007/978-3-642-00599-2_93
M3 - Conference article
AN - SCOPUS:67149125347
VL - 5441
SP - 742
EP - 750
JO - Lecture Notes in Computer Science
JF - Lecture Notes in Computer Science
SN - 0302-9743
T2 - 8th International Conference on Independent Component Analysis and Signal Separation, ICA 2009
Y2 - 15 March 2009 through 18 March 2009
ER -