CENSREC-1-C

An evaluation framework for voice activity detection under noisy environments

Norihide Kitaoka, Takeshi Yamada, Satoru Tsuge, Chiyomi Miyajima, Kazumasa Yamamoto, Takanobu Nishiura, Masato Nakayama, Yuki Denda, Masakiyo Fujimoto, Tetsuya Takiguchi, Satoshi Tamura, Shigeki Matsuda, Tetsuji Ogawa, Shingo Kuroiwa, Kazuya Takeda, Satoshi Nakamura

    Research output: Contribution to journalArticle

    25 Citations (Scopus)

    Abstract

    Voice activity detection (VAD) plays an important role in speech processing including speech recognition, speech enhancement, and speech coding under noisy environments. We have developed an evaluation framework for VAD under noisy environments, named CENSREC-1-C. We designed this framework for simple isolated utterance detection and hence, this framework consists of noisy continuous digit utterances and evaluation tools for VAD results. We define two evaluation measures, one for frame-level detection performance and the other for utterance-level detection performance. We also provide the evaluation results of a power-based VAD method as a reference.

    Original languageEnglish
    Pages (from-to)363-371
    Number of pages9
    JournalAcoustical Science and Technology
    Volume30
    Issue number5
    DOIs
    Publication statusPublished - 2009

    Fingerprint

    evaluation
    digits
    speech recognition
    coding
    augmentation

    Keywords

    • Evaluation framework
    • Noisy speech processing
    • Voice activity detection

    ASJC Scopus subject areas

    • Acoustics and Ultrasonics

    Cite this

    Kitaoka, N., Yamada, T., Tsuge, S., Miyajima, C., Yamamoto, K., Nishiura, T., ... Nakamura, S. (2009). CENSREC-1-C: An evaluation framework for voice activity detection under noisy environments. Acoustical Science and Technology, 30(5), 363-371. https://doi.org/10.1250/ast.30.363

    CENSREC-1-C : An evaluation framework for voice activity detection under noisy environments. / Kitaoka, Norihide; Yamada, Takeshi; Tsuge, Satoru; Miyajima, Chiyomi; Yamamoto, Kazumasa; Nishiura, Takanobu; Nakayama, Masato; Denda, Yuki; Fujimoto, Masakiyo; Takiguchi, Tetsuya; Tamura, Satoshi; Matsuda, Shigeki; Ogawa, Tetsuji; Kuroiwa, Shingo; Takeda, Kazuya; Nakamura, Satoshi.

    In: Acoustical Science and Technology, Vol. 30, No. 5, 2009, p. 363-371.

    Research output: Contribution to journalArticle

    Kitaoka, N, Yamada, T, Tsuge, S, Miyajima, C, Yamamoto, K, Nishiura, T, Nakayama, M, Denda, Y, Fujimoto, M, Takiguchi, T, Tamura, S, Matsuda, S, Ogawa, T, Kuroiwa, S, Takeda, K & Nakamura, S 2009, 'CENSREC-1-C: An evaluation framework for voice activity detection under noisy environments', Acoustical Science and Technology, vol. 30, no. 5, pp. 363-371. https://doi.org/10.1250/ast.30.363
    Kitaoka, Norihide ; Yamada, Takeshi ; Tsuge, Satoru ; Miyajima, Chiyomi ; Yamamoto, Kazumasa ; Nishiura, Takanobu ; Nakayama, Masato ; Denda, Yuki ; Fujimoto, Masakiyo ; Takiguchi, Tetsuya ; Tamura, Satoshi ; Matsuda, Shigeki ; Ogawa, Tetsuji ; Kuroiwa, Shingo ; Takeda, Kazuya ; Nakamura, Satoshi. / CENSREC-1-C : An evaluation framework for voice activity detection under noisy environments. In: Acoustical Science and Technology. 2009 ; Vol. 30, No. 5. pp. 363-371.
    @article{b331f6b516a5462c8ba4f2c9f0fe575b,
    title = "CENSREC-1-C: An evaluation framework for voice activity detection under noisy environments",
    abstract = "Voice activity detection (VAD) plays an important role in speech processing including speech recognition, speech enhancement, and speech coding under noisy environments. We have developed an evaluation framework for VAD under noisy environments, named CENSREC-1-C. We designed this framework for simple isolated utterance detection and hence, this framework consists of noisy continuous digit utterances and evaluation tools for VAD results. We define two evaluation measures, one for frame-level detection performance and the other for utterance-level detection performance. We also provide the evaluation results of a power-based VAD method as a reference.",
    keywords = "Evaluation framework, Noisy speech processing, Voice activity detection",
    author = "Norihide Kitaoka and Takeshi Yamada and Satoru Tsuge and Chiyomi Miyajima and Kazumasa Yamamoto and Takanobu Nishiura and Masato Nakayama and Yuki Denda and Masakiyo Fujimoto and Tetsuya Takiguchi and Satoshi Tamura and Shigeki Matsuda and Tetsuji Ogawa and Shingo Kuroiwa and Kazuya Takeda and Satoshi Nakamura",
    year = "2009",
    doi = "10.1250/ast.30.363",
    language = "English",
    volume = "30",
    pages = "363--371",
    journal = "Acoustical Science and Technology",
    issn = "1346-3969",
    publisher = "Acoustical Society of Japan",
    number = "5",

    }

    TY - JOUR

    T1 - CENSREC-1-C

    T2 - An evaluation framework for voice activity detection under noisy environments

    AU - Kitaoka, Norihide

    AU - Yamada, Takeshi

    AU - Tsuge, Satoru

    AU - Miyajima, Chiyomi

    AU - Yamamoto, Kazumasa

    AU - Nishiura, Takanobu

    AU - Nakayama, Masato

    AU - Denda, Yuki

    AU - Fujimoto, Masakiyo

    AU - Takiguchi, Tetsuya

    AU - Tamura, Satoshi

    AU - Matsuda, Shigeki

    AU - Ogawa, Tetsuji

    AU - Kuroiwa, Shingo

    AU - Takeda, Kazuya

    AU - Nakamura, Satoshi

    PY - 2009

    Y1 - 2009

    N2 - Voice activity detection (VAD) plays an important role in speech processing including speech recognition, speech enhancement, and speech coding under noisy environments. We have developed an evaluation framework for VAD under noisy environments, named CENSREC-1-C. We designed this framework for simple isolated utterance detection and hence, this framework consists of noisy continuous digit utterances and evaluation tools for VAD results. We define two evaluation measures, one for frame-level detection performance and the other for utterance-level detection performance. We also provide the evaluation results of a power-based VAD method as a reference.

    AB - Voice activity detection (VAD) plays an important role in speech processing including speech recognition, speech enhancement, and speech coding under noisy environments. We have developed an evaluation framework for VAD under noisy environments, named CENSREC-1-C. We designed this framework for simple isolated utterance detection and hence, this framework consists of noisy continuous digit utterances and evaluation tools for VAD results. We define two evaluation measures, one for frame-level detection performance and the other for utterance-level detection performance. We also provide the evaluation results of a power-based VAD method as a reference.

    KW - Evaluation framework

    KW - Noisy speech processing

    KW - Voice activity detection

    UR - http://www.scopus.com/inward/record.url?scp=70349094936&partnerID=8YFLogxK

    UR - http://www.scopus.com/inward/citedby.url?scp=70349094936&partnerID=8YFLogxK

    U2 - 10.1250/ast.30.363

    DO - 10.1250/ast.30.363

    M3 - Article

    VL - 30

    SP - 363

    EP - 371

    JO - Acoustical Science and Technology

    JF - Acoustical Science and Technology

    SN - 1346-3969

    IS - 5

    ER -