Morphological predictability of unseen words using computational analogy

Rashel Fam, Yves Lepage

Research output: Contribution to journalArticle

1 Citation (Scopus)

Abstract

We address the problem of predicting unseen words by relying on the organization of the vocabulary of a language as exhibited by paradigm tables. We present a pipeline to automatically produce paradigm tables from all the words contained in a text. We measure how many unseen words from an unseen test text can be predicted using the paradigm tables obtained from a training text. Experiments are carried out in several languages to compare the morphological richness of languages, and also the richness of the vocabulary of different authors.

Original languageEnglish
Pages (from-to)51-60
Number of pages10
JournalCEUR Workshop Proceedings
Volume1815
Publication statusPublished - 2016

Fingerprint

Pipelines
Experiments

Keywords

  • Paradigm tables
  • Unseen words
  • Word predictability

ASJC Scopus subject areas

  • Computer Science(all)

Cite this

Morphological predictability of unseen words using computational analogy. / Fam, Rashel; Lepage, Yves.

In: CEUR Workshop Proceedings, Vol. 1815, 2016, p. 51-60.

Research output: Contribution to journalArticle

@article{4c8b21660e5742448c7fc3f6aabe891a,
title = "Morphological predictability of unseen words using computational analogy",
abstract = "We address the problem of predicting unseen words by relying on the organization of the vocabulary of a language as exhibited by paradigm tables. We present a pipeline to automatically produce paradigm tables from all the words contained in a text. We measure how many unseen words from an unseen test text can be predicted using the paradigm tables obtained from a training text. Experiments are carried out in several languages to compare the morphological richness of languages, and also the richness of the vocabulary of different authors.",
keywords = "Paradigm tables, Unseen words, Word predictability",
author = "Rashel Fam and Yves Lepage",
year = "2016",
language = "English",
volume = "1815",
pages = "51--60",
journal = "CEUR Workshop Proceedings",
issn = "1613-0073",

}

TY - JOUR

T1 - Morphological predictability of unseen words using computational analogy

AU - Fam, Rashel

AU - Lepage, Yves

PY - 2016

Y1 - 2016

N2 - We address the problem of predicting unseen words by relying on the organization of the vocabulary of a language as exhibited by paradigm tables. We present a pipeline to automatically produce paradigm tables from all the words contained in a text. We measure how many unseen words from an unseen test text can be predicted using the paradigm tables obtained from a training text. Experiments are carried out in several languages to compare the morphological richness of languages, and also the richness of the vocabulary of different authors.

AB - We address the problem of predicting unseen words by relying on the organization of the vocabulary of a language as exhibited by paradigm tables. We present a pipeline to automatically produce paradigm tables from all the words contained in a text. We measure how many unseen words from an unseen test text can be predicted using the paradigm tables obtained from a training text. Experiments are carried out in several languages to compare the morphological richness of languages, and also the richness of the vocabulary of different authors.

KW - Paradigm tables

KW - Unseen words

KW - Word predictability

UR - http://www.scopus.com/inward/record.url?scp=85017371499&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85017371499&partnerID=8YFLogxK

M3 - Article

VL - 1815

SP - 51

EP - 60

JO - CEUR Workshop Proceedings

JF - CEUR Workshop Proceedings

SN - 1613-0073

ER -