Japanese Dictation Toolkit -1997 version

Tatsuya Kawahara, Akinobu Lee, Tetsunori Kobayashi, Kazuya Takeda, Nobuaki Minematsu, Katsunobu Itou, Akinori Ito, Mikio Yamamoto, Atsushi Yamada, Takehito Utsuro, Kiyohiro Shikano

研究成果: Article

27 引用 (Scopus)

抜粋

The Japanese Dictation Toolkit has been designed and developed as a baseline platform for Japanese LVCSR (Large Vocabulary Continuous Speech Recognition). The platform consists of a standard recognition engine, Japanese phone models and Japanese statistical language models. We set up a variety of Japanese phone HMMs from a context-independent monophone to a triphone model of thousands of states. They are trained with ASJ (The Acoustical Society of Japan) databases. A lexicon and word N-gram (2-gram and 3-gram) models are constructed with a corpus of Mainichi newspaper. The recognition engine JULIUS is developed for evaluation of both acoustic and language models. As an integrated system of these modules, we have implemented a baseline 5,000-word dictation system and evaluated various components. The software repository is available to the public.

元の言語English
ページ(範囲)233-239
ページ数7
ジャーナルJournal of the Acoustical Society of Japan (E) (English translation of Nippon Onkyo Gakkaishi)
20
発行部数3
DOI
出版物ステータスPublished - 1999 1 1

ASJC Scopus subject areas

  • Acoustics and Ultrasonics

フィンガープリント Japanese Dictation Toolkit -1997 version' の研究トピックを掘り下げます。これらはともに一意のフィンガープリントを構成します。

  • これを引用

    Kawahara, T., Lee, A., Kobayashi, T., Takeda, K., Minematsu, N., Itou, K., Ito, A., Yamamoto, M., Yamada, A., Utsuro, T., & Shikano, K. (1999). Japanese Dictation Toolkit -1997 version. Journal of the Acoustical Society of Japan (E) (English translation of Nippon Onkyo Gakkaishi), 20(3), 233-239. https://doi.org/10.1250/ast.20.233