Consistent word segmentation, part-of-speech tagging and dependency labelling annotation for Chinese language

Mo Shen, Wingmui Li, Hyunjeong Choe, Chenhui Chu, Daisuke Kawahara, Sadao Kurohashi

研究成果

4 被引用数 (Scopus)

抄録

In this paper, we propose a new annotation approach to Chinese word segmentation, part-of-speech (POS) tagging and dependency labelling that aims to overcome the two major issues in traditional morphology-based annotation: Inconsistency and data sparsity. We re-annotate the Penn Chinese Treebank 5.0 (CTB5) and demonstrate the advantages of this approach compared to the original CTB5 annotation through word segmentation, POS tagging and machine translation experiments.

本文言語English
ホスト出版物のタイトルCOLING 2016 - 26th International Conference on Computational Linguistics, Proceedings of COLING 2016
ホスト出版物のサブタイトルTechnical Papers
出版社Association for Computational Linguistics, ACL Anthology
ページ298-308
ページ数11
ISBN(印刷版)9784879747020
出版ステータスPublished - 2016
外部発表はい
イベント26th International Conference on Computational Linguistics, COLING 2016 - Osaka, Japan
継続期間: 2016 12 112016 12 16

出版物シリーズ

名前COLING 2016 - 26th International Conference on Computational Linguistics, Proceedings of COLING 2016: Technical Papers

Other

Other26th International Conference on Computational Linguistics, COLING 2016
国/地域Japan
CityOsaka
Period16/12/1116/12/16

ASJC Scopus subject areas

  • 計算理論と計算数学
  • 言語および言語学
  • 言語学および言語

フィンガープリント

「Consistent word segmentation, part-of-speech tagging and dependency labelling annotation for Chinese language」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル