Out-of-vocabulary word recognition using a hierarchical language model based on multiple Markov models

Hirofumi Yamamoto*, Hiroaki Kokubo, Genichiro Kikui, Yoshihiko Ogawa, Yoshinori Sagisaka

*この研究の対応する著者

研究成果: Article査読

1 被引用数 (Scopus)

抄録

In this paper we propose a language model to solve the issue of task-dependent out-of-vocabulary words in speech recognition. Language model adaptation is a standard method to enable the application of a language model to a new task; however, this approach is not able to deal with the issue of out-of-vocabulary proper names that appear in a task-dependent fashion. In this paper we attempt to solve this issue using a hierarchical language model. In the hierarchical model we use two independent Markov models to constrain the transition probabilities and phonetic sequence emission probabilities of out-of-vocabulary words. In this way we express the emission probabilities of out-of-vocabulary words in the form of a double Markov model that combines both sets of probabilities. We have conducted speech recognition experiments using Japanese dialogue data in the appointments domain. The results show that for sentences containing one or more out-of-vocabulary words, this approach gives a word accuracy rate of 86.7% compared to word accuracy rate of 78.2% when no strategy for out-of-vocabulary words is employed. This corresponds to an elimination of 34.4% of the baseline errors and confirms the effectiveness of the approach.

本文言語English
ページ(範囲)55-64
ページ数10
ジャーナルElectronics and Communications in Japan, Part II: Electronics (English translation of Denshi Tsushin Gakkai Ronbunshi)
88
12
DOI
出版ステータスPublished - 2005 12 1

ASJC Scopus subject areas

  • 物理学および天文学(全般)
  • コンピュータ ネットワークおよび通信
  • 電子工学および電気工学

フィンガープリント

「Out-of-vocabulary word recognition using a hierarchical language model based on multiple Markov models」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル