Rapid development of a corpus with discourse annotations using two-stage crowdsourcing

Daisuke Kawahara, Yuichiro Machida, Tomohide Shibata, Sadao Kurohashi, Hayato Kobayashi, Manabu Sassano

研究成果: Conference contribution

14 被引用数 (Scopus)

抄録

We present a novel approach for rapidly developing a corpus with discourse annotations using crowdsourcing. Although discourse annotations typically require much time and cost owing to their complex nature, we realize discourse annotations in an extremely short time while retaining good quality of the annotations by crowdsourcing two annotation subtasks. In fact, our experiment to create a corpus comprising 30,000 Japanese sentences took less than eight hours to run. Based on this corpus, we also develop a supervised discourse parser and evaluate its performance to verify the usefulness of the acquired corpus.

本文言語English
ホスト出版物のタイトルCOLING 2014 - 25th International Conference on Computational Linguistics, Proceedings of COLING 2014
ホスト出版物のサブタイトルTechnical Papers
出版社Association for Computational Linguistics, ACL Anthology
ページ269-278
ページ数10
ISBN(電子版)9781941643266
出版ステータスPublished - 2014
外部発表はい
イベント25th International Conference on Computational Linguistics, COLING 2014 - Dublin, Ireland
継続期間: 2014 8月 232014 8月 29

出版物シリーズ

名前COLING 2014 - 25th International Conference on Computational Linguistics, Proceedings of COLING 2014: Technical Papers

Conference

Conference25th International Conference on Computational Linguistics, COLING 2014
国/地域Ireland
CityDublin
Period14/8/2314/8/29

ASJC Scopus subject areas

  • 言語および言語学
  • 言語学および言語

フィンガープリント

「Rapid development of a corpus with discourse annotations using two-stage crowdsourcing」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル