A constraint approach to lexicon induction for low-resource languages

Mairidan Wushouer, Donghui Lin, Toru Ishida, Yohei Murakami

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

Bilingual lexicon is a useful language resource, but such data rarely available for lower-density language pairs, especially for those that are closely related. The lack or absence of parallel and comparable corpora makes bilingual lexicon extraction becomes a difficult task. Using a third language to link two other languages is a well-known solution in low-resource situation, which usually requires only two input bilingual lexicons to automatically induce the new one. This approach, however, is weak in measuring semantic distance between bilingual word pairs because it has never been demonstrated to utilize the complete structures of the input bilingual lexicons as dropped meanings negatively influence the result. This research discuss a constraint approach to pivot-based lexicon induction in case the target language pair are closely related. We create constraints from language similarity and model the structures of the input dictionaries as an optimization problem whose solution produces optimally correct target bilingual lexicon. In addition, we enable created bilingual lexicons of low-resource languages accessible through service grid federation.

Original languageEnglish
Title of host publicationCognitive Technologies
PublisherSpringer-Verlag
Pages109-123
Number of pages15
Edition9789811077920
DOIs
Publication statusPublished - 2018 Jan 1
Externally publishedYes

Publication series

NameCognitive Technologies
Number9789811077920
ISSN (Print)1611-2482

Keywords

  • Bilingual dictionary induction
  • Constraint satisfaction problem
  • Low-resource languages
  • Pivot language
  • Weighted partial max-SAT

ASJC Scopus subject areas

  • Software
  • Artificial Intelligence

Fingerprint Dive into the research topics of 'A constraint approach to lexicon induction for low-resource languages'. Together they form a unique fingerprint.

  • Cite this

    Wushouer, M., Lin, D., Ishida, T., & Murakami, Y. (2018). A constraint approach to lexicon induction for low-resource languages. In Cognitive Technologies (9789811077920 ed., pp. 109-123). (Cognitive Technologies; No. 9789811077920). Springer-Verlag. https://doi.org/10.1007/978-981-10-7793-7_7