Reliable utterance segment recognition by integrating a grammar with statistical language constraints

Hajime Tsukada*, Hirofumi Yamamoto, Toshiyuki Takezawa, Yoshinori Sagisaka

*この研究の対応する著者

研究成果: Article査読

1 被引用数 (Scopus)

抄録

This paper proposes a novel approach to the recognition of complete utterances and partial segments of utterances. This approach ensures a high level of confidence in the results. The proposed method is based on the cooperative use of a conventional n-gram constraint and additional grammatical constraints which take deviations from the grammar into account with a multi-pass search strategy. The partial utterance segments are obtained with high confidence as the segments that satisfy both n-gram and grammatical constraints. For improved efficiency, the context-free grammar expressing the grammatical constraints is approximated by a finite-state automaton. We consider all kinds of deviations from the grammar such as insertions, deletions and substitutions when applying the grammatical constraints. As a result, we can achieve a more robust application of grammatical constraints compared to a conventional word-skipping robust parser that can only handle one type of deviation, that is, insertions. Our experiments confirm that the proposed method can recognize partial segments of utterances more reliably than conventional continuous speech recognition methods using only n-grams. In addition, our results indicate that allowing more deviations from the grammatical constraints leads to better performance than the conventional word-skipping robust parser approach.

本文言語English
ページ(範囲)299-309
ページ数11
ジャーナルSpeech Communication
26
4
DOI
出版ステータスPublished - 1998 12
外部発表はい

ASJC Scopus subject areas

  • ソフトウェア
  • モデリングとシミュレーション
  • 通信
  • 言語および言語学
  • 言語学および言語
  • コンピュータ ビジョンおよびパターン認識
  • コンピュータ サイエンスの応用

フィンガープリント

「Reliable utterance segment recognition by integrating a grammar with statistical language constraints」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル