CENSREC-1-C: An evaluation framework for voice activity detection under noisy environments

Norihide Kitaoka, Takeshi Yamada, Satoru Tsuge, Chiyomi Miyajima, Kazumasa Yamamoto, Takanobu Nishiura, Masato Nakayama, Yuki Denda, Masakiyo Fujimoto, Tetsuya Takiguchi, Satoshi Tamura, Shigeki Matsuda, Tetsuji Ogawa, Shingo Kuroiwa, Kazuya Takeda, Satoshi Nakamura

Research output: Contribution to journalArticle

25 Citations (Scopus)

Abstract

Voice activity detection (VAD) plays an important role in speech processing including speech recognition, speech enhancement, and speech coding under noisy environments. We have developed an evaluation framework for VAD under noisy environments, named CENSREC-1-C. We designed this framework for simple isolated utterance detection and hence, this framework consists of noisy continuous digit utterances and evaluation tools for VAD results. We define two evaluation measures, one for frame-level detection performance and the other for utterance-level detection performance. We also provide the evaluation results of a power-based VAD method as a reference.

Original languageEnglish
Pages (from-to)363-371
Number of pages9
JournalAcoustical Science and Technology
Volume30
Issue number5
DOIs
Publication statusPublished - 2009 Sep 21

Keywords

  • Evaluation framework
  • Noisy speech processing
  • Voice activity detection

ASJC Scopus subject areas

  • Acoustics and Ultrasonics

Fingerprint Dive into the research topics of 'CENSREC-1-C: An evaluation framework for voice activity detection under noisy environments'. Together they form a unique fingerprint.

  • Cite this

    Kitaoka, N., Yamada, T., Tsuge, S., Miyajima, C., Yamamoto, K., Nishiura, T., Nakayama, M., Denda, Y., Fujimoto, M., Takiguchi, T., Tamura, S., Matsuda, S., Ogawa, T., Kuroiwa, S., Takeda, K., & Nakamura, S. (2009). CENSREC-1-C: An evaluation framework for voice activity detection under noisy environments. Acoustical Science and Technology, 30(5), 363-371. https://doi.org/10.1250/ast.30.363