CENSREC-4: An evaluation framework for distant-talking speech recognition in reverberant environments

Takahiro Fukumori, Takanobu Nishiura, Masato Nakayama, Yuki Denda, Norihide Kitaoka, Takeshi Yamada, Kazumasa Yamamoto, Satoru Tsuge, Masakiyo Fujimoto, Tetsuya Takiguchi, Chiyomi Miyajima, Satoshi Tamura, Tetsuji Ogawa, Shigeki Matsuda, Shingo Kuroiwa, Kazuya Takeda, Satoshi Nakamura

    研究成果: Article

    2 引用 (Scopus)

    抜粋

    We have been distributing a new collection of databases and evaluation tools called CENSREC-4, which is a framework for evaluating distant-talking speech in reverberant environments. The data contained in CENSREC-4 are connected digit utterances as in CENSREC-1. Two subsets are included in the data: "basic data sets" and "extra data sets." The basic data sets are used for evaluating the room impulse response-convolved speech data to simulate the various reverberations. The extra data sets consist of simulated data and corresponding real recorded data. Evaluation tools are presently only provided for the basic data sets and will be delivered to the extra data sets in the future. The task of CENSREC-4 with a basic data set appears simple; however, the results of experiments prove that CENSREC-4 provides a challenging reverberation speech-recognition task, in the sense that a traditional technique to improve recognition and a widely used criterion to represent the difficulty of recognition deliver poor performance. Within this context, this common framework can be an important step toward the future evolution of reverberant speech-recognition methodologies.

    元の言語English
    ページ(範囲)201-210
    ページ数10
    ジャーナルAcoustical Science and Technology
    32
    発行部数5
    DOI
    出版物ステータスPublished - 2011

      フィンガープリント

    ASJC Scopus subject areas

    • Acoustics and Ultrasonics

    これを引用

    Fukumori, T., Nishiura, T., Nakayama, M., Denda, Y., Kitaoka, N., Yamada, T., Yamamoto, K., Tsuge, S., Fujimoto, M., Takiguchi, T., Miyajima, C., Tamura, S., Ogawa, T., Matsuda, S., Kuroiwa, S., Takeda, K., & Nakamura, S. (2011). CENSREC-4: An evaluation framework for distant-talking speech recognition in reverberant environments. Acoustical Science and Technology, 32(5), 201-210. https://doi.org/10.1250/ast.32.201