Design and creation of speech and text corpora of dialogue

Satoru Hayamizu*, Shuichi Itahashi, Tetsunori Kobayashi, Toshiyuki Takezawa

*この研究の対応する著者

研究成果: Article査読

8 被引用数 (Scopus)

抄録

This paper describes issues on dialogue corpora for speech and natural language research. Speech and text corpora of dialogue have recently become more important for the development and the evaluation of speech and text-based dialogue systems. However, the design and the construction of dialogue corpora themselves still remain research issues and many problems have not yet been clarified. Many kinds of corpus are necessary to study various aspects of dialogues. On the other hand, each corpus should contain a certain quantity for each purpose in order to make it statistically meaningful. This paper presents the issues related with design and creation of dialogue corpora; the selection of a task domain, transcription conventions, situations for the collection, syntactic and semantic ill-formedness, and politeness. Future directions for dialogue corpora creation are also discussed.

本文言語English
ページ(範囲)17-22
ページ数6
ジャーナルIEICE Transactions on Information and Systems
E76-D
1
出版ステータスPublished - 1993 1 1
外部発表はい

ASJC Scopus subject areas

  • ソフトウェア
  • ハードウェアとアーキテクチャ
  • コンピュータ ビジョンおよびパターン認識
  • 電子工学および電気工学
  • 人工知能

フィンガープリント

「Design and creation of speech and text corpora of dialogue」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル