Automatic generation of multiple pronunciations based on neural networks

Toshiaki Fukada, Takayoshi Yoshimura, Yoshinori Sagisaka

研究成果: Article査読

27 被引用数 (Scopus)

抄録

We propose a method for automatically generating a pronunciation dictionary based on a pronunciation neural network that can predict plausible pronunciations (realized pronunciations) from the canonical pronunciation. This method can generate multiple forms of realized pronunciations using the pronunciation network. For generating a sophisticated realized pronunciation dictionary, two techniques are described: (1) realized pronunciations with likelihoods and (2) realized pronunciations for word boundary phonemes. Experimental results on spontaneous speech show that the automatically derived pronunciation dictionaries give consistently higher recognition rates than a conventional dictionary.

本文言語English
ページ(範囲)63-73
ページ数11
ジャーナルSpeech Communication
27
1
DOI
出版ステータスPublished - 1999
外部発表はい

ASJC Scopus subject areas

  • Software
  • Modelling and Simulation
  • Communication
  • Language and Linguistics
  • Linguistics and Language
  • Computer Vision and Pattern Recognition
  • Computer Science Applications

フィンガープリント 「Automatic generation of multiple pronunciations based on neural networks」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル