TY - JOUR
T1 - Blind extraction of a dominant source from mixtures of many sources using ICA and time-frequency masking
AU - Sawada, Hiroshi
AU - Araki, Shoko
AU - Mukai, Ryo
AU - Makino, Shoji
PY - 2005
Y1 - 2005
N2 - This paper presents a method for enhancing a target source of interest and suppressing other interference sources. The target source is assumed to be close to sensors, to have dominant power at these sensors, and to have non-Gaussianity. The enhancement is performed blindly, i.e. without knowing the total number of sources or information about each source, such as position and active time. We consider a general case where the number of sources is larger than the number of sensors. We employ a two-stage process where independent component analysis (ICA) is first employed in each frequency bin and time-frequency masking is then used to improve the performance further. We propose a new sophisticated method for selecting the target source frequency components, and also a new criterion for specifying time-frequency masks. Experimental results for simulated cocktail party situations in a room (reverberation time was 130 ms) are presented to show the effectiveness and characteristics of the proposed method.
AB - This paper presents a method for enhancing a target source of interest and suppressing other interference sources. The target source is assumed to be close to sensors, to have dominant power at these sensors, and to have non-Gaussianity. The enhancement is performed blindly, i.e. without knowing the total number of sources or information about each source, such as position and active time. We consider a general case where the number of sources is larger than the number of sensors. We employ a two-stage process where independent component analysis (ICA) is first employed in each frequency bin and time-frequency masking is then used to improve the performance further. We propose a new sophisticated method for selecting the target source frequency components, and also a new criterion for specifying time-frequency masks. Experimental results for simulated cocktail party situations in a room (reverberation time was 130 ms) are presented to show the effectiveness and characteristics of the proposed method.
UR - http://www.scopus.com/inward/record.url?scp=33947670884&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33947670884&partnerID=8YFLogxK
U2 - 10.1109/ISCAS.2005.1465977
DO - 10.1109/ISCAS.2005.1465977
M3 - Conference article
AN - SCOPUS:33947670884
SP - 5882
EP - 5885
JO - Proceedings - IEEE International Symposium on Circuits and Systems
JF - Proceedings - IEEE International Symposium on Circuits and Systems
SN - 0271-4310
M1 - 1465977
T2 - IEEE International Symposium on Circuits and Systems 2005, ISCAS 2005
Y2 - 23 May 2005 through 26 May 2005
ER -