Organising lexica into analogical grids: a study of a holistic approach for morphological generation under various sizes of data in various languages

Rashel Fam*, Yves Lepage

*この研究の対応する著者

研究成果: Article査読

抄録

Morphological generation is a task where given a lemma and a morphosyntactic description of the target form, we are asked to generate the target form. Knowing that the syntactic and semantic relations to other forms are reflected by the word form itself, we show how to exploit these relations between word forms, holistically, that is, as a whole, to derive the target form without even breaking them into morphemes. Experimental results show that by organising the lexica into analogical grids we are able to improve the accuracy of morphological generation by up to 8% in low data scenarios. Our holistic approach always performs better than a morpheme-based baseline. We also enquire possible improvements by using data augmentation for neural approaches, especially in low data scenarios. However, our system seems not to gain any advantage from having more data after some point in time.

本文言語English
ジャーナルJournal of Experimental and Theoretical Artificial Intelligence
DOI
出版ステータスAccepted/In press - 2022

ASJC Scopus subject areas

  • ソフトウェア
  • 理論的コンピュータサイエンス
  • 人工知能

フィンガープリント

「Organising lexica into analogical grids: a study of a holistic approach for morphological generation under various sizes of data in various languages」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル