Morphological predictability of unseen words using computational analogy

Rashel Fam, Yves Lepage

Research output: Contribution to journalArticle

3 Citations (Scopus)

Abstract

We address the problem of predicting unseen words by relying on the organization of the vocabulary of a language as exhibited by paradigm tables. We present a pipeline to automatically produce paradigm tables from all the words contained in a text. We measure how many unseen words from an unseen test text can be predicted using the paradigm tables obtained from a training text. Experiments are carried out in several languages to compare the morphological richness of languages, and also the richness of the vocabulary of different authors.

Original languageEnglish
Pages (from-to)51-60
Number of pages10
JournalCEUR Workshop Proceedings
Volume1815
Publication statusPublished - 2016

    Fingerprint

Keywords

  • Paradigm tables
  • Unseen words
  • Word predictability

ASJC Scopus subject areas

  • Computer Science(all)

Cite this