CLASS-COMBINED ORD N-GRAM FOR ROBUST LANGUAGE ODELING

Norihiko Kobayashi, Tetsunori Kobayashi

研究成果: Paper査読

1 被引用数 (Scopus)

抄録

We propose a method of robust language model ing for a small amount of training text corpus. In this method, the word bigram and the class bigram are combined using a weighting function of preceding word frequency. We made experiments on speech recogni tion using JNAS speech corpus. As the results, it was proved that the performance of the class combined bi gram is equivalent to that of the word bigram trained with 2.5 larger size of corpus. We also made experi ments using sports news dialogue on TV. Recognition accuracy of the class-combined bigram was 83.3% that was 5.5 point higher than that of the word bigram.

本文言語English
ページ1599-1602
ページ数4
出版ステータスPublished - 1999
イベント6th European Conference on Speech Communication and Technology, EUROSPEECH 1999 - Budapest, Hungary
継続期間: 1999 9月 51999 9月 9

Conference

Conference6th European Conference on Speech Communication and Technology, EUROSPEECH 1999
国/地域Hungary
CityBudapest
Period99/9/599/9/9

ASJC Scopus subject areas

  • コンピュータ サイエンスの応用
  • ソフトウェア
  • 言語学および言語
  • 通信

フィンガープリント

「CLASS-COMBINED ORD N-GRAM FOR ROBUST LANGUAGE ODELING」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル