Japanese Dictation Toolkit -1997 version

Tatsuya Kawahara, Akinobu Lee, Tetsunori Kobayashi, Kazuya Takeda, Nobuaki Minematsu, Katsunobu Itou, Akinori Ito, Mikio Yamamoto, Atsushi Yamada, Takehito Utsuro, Kiyohiro Shikano

Research output: Contribution to journalArticle

27 Citations (Scopus)

Abstract

The Japanese Dictation Toolkit has been designed and developed as a baseline platform for Japanese LVCSR (Large Vocabulary Continuous Speech Recognition). The platform consists of a standard recognition engine, Japanese phone models and Japanese statistical language models. We set up a variety of Japanese phone HMMs from a context-independent monophone to a triphone model of thousands of states. They are trained with ASJ (The Acoustical Society of Japan) databases. A lexicon and word N-gram (2-gram and 3-gram) models are constructed with a corpus of Mainichi newspaper. The recognition engine JULIUS is developed for evaluation of both acoustic and language models. As an integrated system of these modules, we have implemented a baseline 5,000-word dictation system and evaluated various components. The software repository is available to the public.

Original languageEnglish
Pages (from-to)233-239
Number of pages7
JournalJournal of the Acoustical Society of Japan (E) (English translation of Nippon Onkyo Gakkaishi)
Volume20
Issue number3
DOIs
Publication statusPublished - 1999 Jan 1

Keywords

  • Large vocabulary continuous speech recognition
  • Software

ASJC Scopus subject areas

  • Acoustics and Ultrasonics

Fingerprint Dive into the research topics of 'Japanese Dictation Toolkit -1997 version'. Together they form a unique fingerprint.

  • Cite this

    Kawahara, T., Lee, A., Kobayashi, T., Takeda, K., Minematsu, N., Itou, K., Ito, A., Yamamoto, M., Yamada, A., Utsuro, T., & Shikano, K. (1999). Japanese Dictation Toolkit -1997 version. Journal of the Acoustical Society of Japan (E) (English translation of Nippon Onkyo Gakkaishi), 20(3), 233-239. https://doi.org/10.1250/ast.20.233