TY - JOUR
T1 - Bayesian nonparametrics for microphone array processing
AU - Otsuka, Takuma
AU - Ishiguro, Katsuhiko
AU - Sawada, Hiroshi
AU - Okuno, Hiroshi G.
PY - 2014
Y1 - 2014
N2 - Sound source localization and separation from a mixture of sounds are essential functions for computational auditory scene analysis. The main challenges are designing a unified framework for joint optimization and estimating the sound sources under auditory uncertainties such as reverberation or unknown number of sounds. Since sound source localization and separation are mutually dependent, their simultaneous estimation is required for better and more robust performance. A unified model is presented for sound source localization and separation based on Bayesian nonparametrics. Experiments using simulated and recorded audio mixtures show that a method based on this model achieves state-of-the-art sound source separation quality and has more robust performance on the source number estimation under reverberant environments.
AB - Sound source localization and separation from a mixture of sounds are essential functions for computational auditory scene analysis. The main challenges are designing a unified framework for joint optimization and estimating the sound sources under auditory uncertainties such as reverberation or unknown number of sounds. Since sound source localization and separation are mutually dependent, their simultaneous estimation is required for better and more robust performance. A unified model is presented for sound source localization and separation based on Bayesian nonparametrics. Experiments using simulated and recorded audio mixtures show that a method based on this model achieves state-of-the-art sound source separation quality and has more robust performance on the source number estimation under reverberant environments.
KW - Audio source separation and enhancement (AUDSSEN)
KW - Bayesian nonparametrics
KW - Blind source separation
KW - Microphone array processing
KW - Sound source localization
KW - Spatial and multichannel audio (AUD-SMCA)
KW - Time-frequency masking
UR - http://www.scopus.com/inward/record.url?scp=84897935648&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84897935648&partnerID=8YFLogxK
U2 - 10.1109/TASLP.2013.2294582
DO - 10.1109/TASLP.2013.2294582
M3 - Article
AN - SCOPUS:84897935648
VL - 22
SP - 493
EP - 504
JO - IEEE Transactions on Speech and Audio Processing
JF - IEEE Transactions on Speech and Audio Processing
SN - 1558-7916
IS - 2
ER -