TY - GEN
T1 - Automatic estimation of reverberation time with robot speech to improve ICA-based robot audition
AU - Takeda, Ryu
AU - Nakadai, Kazuhiro
AU - Takahashi, Toru
AU - Komatani, Kazunori
AU - Ogata, Tetsuya
AU - Okuno, Hiroshi G.
PY - 2009
Y1 - 2009
N2 - This paper presents an ICA-based robot audition system which estimates the reverberation time of the environment automatically by using the robot's own speech. The system is based on multi-channel semi-blind independent component analysis (MCSB-ICA), a source separation method using a microphone array that can separate user and robot speech under reverberant environments. Perception of the reverberation time (RT) is critical, because an inappropriate RT degrades separation performance and increases processing time. Unlike most previous methods that assume the RT is given in advance, our method estimates an RT by using the echo's intensity of the robot's own speech. It has three steps: speaks a sentence in a new environment, calculates the relative powers of the echoes, and estimates the RT using linear regression of them. Experimental results show that this method sets an appropriate RT for MCSB-ICA for real-world environments and that word correctness is improved by up to 6 points and processing time is reduced by up to 60%.
AB - This paper presents an ICA-based robot audition system which estimates the reverberation time of the environment automatically by using the robot's own speech. The system is based on multi-channel semi-blind independent component analysis (MCSB-ICA), a source separation method using a microphone array that can separate user and robot speech under reverberant environments. Perception of the reverberation time (RT) is critical, because an inappropriate RT degrades separation performance and increases processing time. Unlike most previous methods that assume the RT is given in advance, our method estimates an RT by using the echo's intensity of the robot's own speech. It has three steps: speaks a sentence in a new environment, calculates the relative powers of the echoes, and estimates the RT using linear regression of them. Experimental results show that this method sets an appropriate RT for MCSB-ICA for real-world environments and that word correctness is improved by up to 6 points and processing time is reduced by up to 60%.
UR - http://www.scopus.com/inward/record.url?scp=77950583451&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=77950583451&partnerID=8YFLogxK
U2 - 10.1109/ICHR.2009.5379572
DO - 10.1109/ICHR.2009.5379572
M3 - Conference contribution
AN - SCOPUS:77950583451
SN - 9781424445882
T3 - 9th IEEE-RAS International Conference on Humanoid Robots, HUMANOIDS09
SP - 250
EP - 255
BT - 9th IEEE-RAS International Conference on Humanoid Robots, HUMANOIDS09
T2 - 9th IEEE-RAS International Conference on Humanoid Robots, HUMANOIDS09
Y2 - 7 December 2009 through 10 December 2009
ER -