Free software toolkit for Japanese large vocabulary continuous speech recognition

Tatsuya Kawahara, Akinobu Lee, Tetsunori Kobayashi, Kazuya Takeda, Nobuaki Minematsu, Shigeki Sagayama, Katsunobu Itou, Akinori Ito, Mikio Yamamoto, Atsushi Yamada, Takehito Utsuro, Kiyohiro Shikano

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    81 Citations (Scopus)

    Abstract

    A sharable software repository for Japanese LVCSR (Large Vocabulary Continuous Speech Recognition) is introduced. It is designed as a baseline platform for research and developed by researchers of different academic institutes under a governmental support. The repository consists of a recognition engine (Julius), Japanese acoustic models and statistical language models as well as Japanese morphological analysis tools. These modules can be easily integrated and replaced under a plug-and-play framework, which makes it possible to fairly evaluate components and to develop specific application systems. Assessment of these modules and systems in a 20000-word dictation task is reported. The software repository is freely available to the public.

    Original languageEnglish
    Title of host publication6th International Conference on Spoken Language Processing, ICSLP 2000
    PublisherInternational Speech Communication Association
    ISBN (Electronic)7801501144, 9787801501141
    Publication statusPublished - 2000
    Event6th International Conference on Spoken Language Processing, ICSLP 2000 - Beijing, China
    Duration: 2000 Oct 162000 Oct 20

    Other

    Other6th International Conference on Spoken Language Processing, ICSLP 2000
    CountryChina
    CityBeijing
    Period00/10/1600/10/20

    Fingerprint

    vocabulary
    acoustics
    language
    software
    Speech Recognition
    Software
    Vocabulary
    Repository
    Continuous Speech
    Toolkit
    Module
    Morphological Analysis
    Acoustics
    Dictation
    Language Model

    ASJC Scopus subject areas

    • Linguistics and Language
    • Language and Linguistics

    Cite this

    Kawahara, T., Lee, A., Kobayashi, T., Takeda, K., Minematsu, N., Sagayama, S., ... Shikano, K. (2000). Free software toolkit for Japanese large vocabulary continuous speech recognition. In 6th International Conference on Spoken Language Processing, ICSLP 2000 International Speech Communication Association.

    Free software toolkit for Japanese large vocabulary continuous speech recognition. / Kawahara, Tatsuya; Lee, Akinobu; Kobayashi, Tetsunori; Takeda, Kazuya; Minematsu, Nobuaki; Sagayama, Shigeki; Itou, Katsunobu; Ito, Akinori; Yamamoto, Mikio; Yamada, Atsushi; Utsuro, Takehito; Shikano, Kiyohiro.

    6th International Conference on Spoken Language Processing, ICSLP 2000. International Speech Communication Association, 2000.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Kawahara, T, Lee, A, Kobayashi, T, Takeda, K, Minematsu, N, Sagayama, S, Itou, K, Ito, A, Yamamoto, M, Yamada, A, Utsuro, T & Shikano, K 2000, Free software toolkit for Japanese large vocabulary continuous speech recognition. in 6th International Conference on Spoken Language Processing, ICSLP 2000. International Speech Communication Association, 6th International Conference on Spoken Language Processing, ICSLP 2000, Beijing, China, 00/10/16.
    Kawahara T, Lee A, Kobayashi T, Takeda K, Minematsu N, Sagayama S et al. Free software toolkit for Japanese large vocabulary continuous speech recognition. In 6th International Conference on Spoken Language Processing, ICSLP 2000. International Speech Communication Association. 2000
    Kawahara, Tatsuya ; Lee, Akinobu ; Kobayashi, Tetsunori ; Takeda, Kazuya ; Minematsu, Nobuaki ; Sagayama, Shigeki ; Itou, Katsunobu ; Ito, Akinori ; Yamamoto, Mikio ; Yamada, Atsushi ; Utsuro, Takehito ; Shikano, Kiyohiro. / Free software toolkit for Japanese large vocabulary continuous speech recognition. 6th International Conference on Spoken Language Processing, ICSLP 2000. International Speech Communication Association, 2000.
    @inproceedings{e44bb01aeefc4c1db4a0c6c57761d747,
    title = "Free software toolkit for Japanese large vocabulary continuous speech recognition",
    abstract = "A sharable software repository for Japanese LVCSR (Large Vocabulary Continuous Speech Recognition) is introduced. It is designed as a baseline platform for research and developed by researchers of different academic institutes under a governmental support. The repository consists of a recognition engine (Julius), Japanese acoustic models and statistical language models as well as Japanese morphological analysis tools. These modules can be easily integrated and replaced under a plug-and-play framework, which makes it possible to fairly evaluate components and to develop specific application systems. Assessment of these modules and systems in a 20000-word dictation task is reported. The software repository is freely available to the public.",
    author = "Tatsuya Kawahara and Akinobu Lee and Tetsunori Kobayashi and Kazuya Takeda and Nobuaki Minematsu and Shigeki Sagayama and Katsunobu Itou and Akinori Ito and Mikio Yamamoto and Atsushi Yamada and Takehito Utsuro and Kiyohiro Shikano",
    year = "2000",
    language = "English",
    booktitle = "6th International Conference on Spoken Language Processing, ICSLP 2000",
    publisher = "International Speech Communication Association",

    }

    TY - GEN

    T1 - Free software toolkit for Japanese large vocabulary continuous speech recognition

    AU - Kawahara, Tatsuya

    AU - Lee, Akinobu

    AU - Kobayashi, Tetsunori

    AU - Takeda, Kazuya

    AU - Minematsu, Nobuaki

    AU - Sagayama, Shigeki

    AU - Itou, Katsunobu

    AU - Ito, Akinori

    AU - Yamamoto, Mikio

    AU - Yamada, Atsushi

    AU - Utsuro, Takehito

    AU - Shikano, Kiyohiro

    PY - 2000

    Y1 - 2000

    N2 - A sharable software repository for Japanese LVCSR (Large Vocabulary Continuous Speech Recognition) is introduced. It is designed as a baseline platform for research and developed by researchers of different academic institutes under a governmental support. The repository consists of a recognition engine (Julius), Japanese acoustic models and statistical language models as well as Japanese morphological analysis tools. These modules can be easily integrated and replaced under a plug-and-play framework, which makes it possible to fairly evaluate components and to develop specific application systems. Assessment of these modules and systems in a 20000-word dictation task is reported. The software repository is freely available to the public.

    AB - A sharable software repository for Japanese LVCSR (Large Vocabulary Continuous Speech Recognition) is introduced. It is designed as a baseline platform for research and developed by researchers of different academic institutes under a governmental support. The repository consists of a recognition engine (Julius), Japanese acoustic models and statistical language models as well as Japanese morphological analysis tools. These modules can be easily integrated and replaced under a plug-and-play framework, which makes it possible to fairly evaluate components and to develop specific application systems. Assessment of these modules and systems in a 20000-word dictation task is reported. The software repository is freely available to the public.

    UR - http://www.scopus.com/inward/record.url?scp=85009144958&partnerID=8YFLogxK

    UR - http://www.scopus.com/inward/citedby.url?scp=85009144958&partnerID=8YFLogxK

    M3 - Conference contribution

    BT - 6th International Conference on Spoken Language Processing, ICSLP 2000

    PB - International Speech Communication Association

    ER -