Abstract
Multichannel signal processing using a microphone array provides fundamental functions for copingwith multi-source situations, such as sound source localization and separation, that are needed to extract the auditory information for each source. Auditory uncertainties about the degree of reverberation and the number of sources are known to degrade performance or limit the practical application of microphone array processing. Such uncertainties must therefore be overcome to realize general and robust microphone array processing. These uncertainty issues have been partly addressed-existing methods focus on either source number uncertainty or the reverberation issue, where joint separation and dereverberation has been achieved only for the overdetermined conditions. This paper presents an all-round method that achieves source separation and dereverberation for an arbitrary number of sources including underdetermined conditions. Our method uses Bayesian nonparametrics that realize an infinitely extensible modeling flexibility so as to bypass the model selection in the separation and dereverberation problem, which is caused by the source number uncertainty. Evaluation using a dereverberation and separation task with various numbers of sources including underdetermined conditions demonstrates that (1) ourmethod is applicable to the separation and dereverberation of underdetermined mixtures, and that (2) the source extraction performance is comparable to that of a state-of-the-art method suitable only for overdetermined conditions.
Original language | English |
---|---|
Article number | 6926796 |
Pages (from-to) | 2218-2232 |
Number of pages | 15 |
Journal | IEEE/ACM Transactions on Speech and Language Processing |
Volume | 22 |
Issue number | 12 |
DOIs | |
Publication status | Published - 2014 Dec 1 |
Keywords
- Bayesian nonparametrics
- Blind dereverberation
- Blind source separation
- Markov chain Monte Carlo method
- Microphone array processing
- Underdetermined mixtures
ASJC Scopus subject areas
- Signal Processing
- Electrical and Electronic Engineering
- Media Technology
- Acoustics and Ultrasonics
- Instrumentation
- Linguistics and Language
- Speech and Hearing