TY - JOUR
T1 - CENSREC-4
T2 - An evaluation framework for distant-talking speech recognition in reverberant environments
AU - Fukumori, Takahiro
AU - Nishiura, Takanobu
AU - Nakayama, Masato
AU - Denda, Yuki
AU - Kitaoka, Norihide
AU - Yamada, Takeshi
AU - Yamamoto, Kazumasa
AU - Tsuge, Satoru
AU - Fujimoto, Masakiyo
AU - Takiguchi, Tetsuya
AU - Miyajima, Chiyomi
AU - Tamura, Satoshi
AU - Ogawa, Tetsuji
AU - Matsuda, Shigeki
AU - Kuroiwa, Shingo
AU - Takeda, Kazuya
AU - Nakamura, Satoshi
PY - 2011
Y1 - 2011
N2 - We have been distributing a new collection of databases and evaluation tools called CENSREC-4, which is a framework for evaluating distant-talking speech in reverberant environments. The data contained in CENSREC-4 are connected digit utterances as in CENSREC-1. Two subsets are included in the data: "basic data sets" and "extra data sets." The basic data sets are used for evaluating the room impulse response-convolved speech data to simulate the various reverberations. The extra data sets consist of simulated data and corresponding real recorded data. Evaluation tools are presently only provided for the basic data sets and will be delivered to the extra data sets in the future. The task of CENSREC-4 with a basic data set appears simple; however, the results of experiments prove that CENSREC-4 provides a challenging reverberation speech-recognition task, in the sense that a traditional technique to improve recognition and a widely used criterion to represent the difficulty of recognition deliver poor performance. Within this context, this common framework can be an important step toward the future evolution of reverberant speech-recognition methodologies.
AB - We have been distributing a new collection of databases and evaluation tools called CENSREC-4, which is a framework for evaluating distant-talking speech in reverberant environments. The data contained in CENSREC-4 are connected digit utterances as in CENSREC-1. Two subsets are included in the data: "basic data sets" and "extra data sets." The basic data sets are used for evaluating the room impulse response-convolved speech data to simulate the various reverberations. The extra data sets consist of simulated data and corresponding real recorded data. Evaluation tools are presently only provided for the basic data sets and will be delivered to the extra data sets in the future. The task of CENSREC-4 with a basic data set appears simple; however, the results of experiments prove that CENSREC-4 provides a challenging reverberation speech-recognition task, in the sense that a traditional technique to improve recognition and a widely used criterion to represent the difficulty of recognition deliver poor performance. Within this context, this common framework can be an important step toward the future evolution of reverberant speech-recognition methodologies.
KW - Evaluation framework
KW - Reverberant speech database
KW - Reverberant speech recognition
KW - Room impulse response
KW - Various recording environments
UR - http://www.scopus.com/inward/record.url?scp=80052431269&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=80052431269&partnerID=8YFLogxK
U2 - 10.1250/ast.32.201
DO - 10.1250/ast.32.201
M3 - Article
AN - SCOPUS:80052431269
SN - 1346-3969
VL - 32
SP - 201
EP - 210
JO - Journal of the Acoustical Society of Japan (E) (English translation of Nippon Onkyo Gakkaishi)
JF - Journal of the Acoustical Society of Japan (E) (English translation of Nippon Onkyo Gakkaishi)
IS - 5
ER -