Plan optimization for creating bilingual dictionaries of low-resource languages

Arbi Haza Nasution, Yohei Murakami, Toru Ishida

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Citations (Scopus)

Abstract

The constraint-based approach has been proven useful for inducing bilingual lexicons for closely-related low-resource languages. When we want to create multiple bilingual dictionaries linking several languages, we need to consider manual creation by bilingual language experts if there are no available machine-readable dictionaries are available as input. To overcome the difficulty in planning the creation of bilingual dictionaries, the consideration of various methods and costs, plan optimization is essential. We adopt the Markov Decision Process (MDP) in formalizing plan optimization for creating bilingual dictionaries; the goal is to better predict the most feasible optimal plan with the least total cost before fully implementing the constraint-based bilingual dictionary induction framework. We define heuristics based on input language characteristics to devise a baseline plan for evaluating our MDP-based approach with total cost as an evaluation metric. The MDP-based proposal outperformed heuristic planning on total cost for all datasets examined.

Original languageEnglish
Title of host publicationProceedings - 2017 International Conference on Culture and Computing, Culture and Computing 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages35-41
Number of pages7
ISBN (Electronic)9781538611357
DOIs
Publication statusPublished - 2017 Dec 19
Externally publishedYes
Event2017 International Conference on Culture and Computing, Culture and Computing 2017 - Kyoto, Japan
Duration: 2017 Sep 102017 Sep 12

Publication series

NameProceedings - 2017 International Conference on Culture and Computing, Culture and Computing 2017
Volume2017-December

Other

Other2017 International Conference on Culture and Computing, Culture and Computing 2017
CountryJapan
CityKyoto
Period17/9/1017/9/12

    Fingerprint

Keywords

  • Closely-related Languages
  • Low-resource Languages
  • Markov Decision Process
  • Pivot-based Bilingual Dictionary Induction
  • Plan Optimization

ASJC Scopus subject areas

  • Media Technology
  • Cultural Studies
  • Computer Science Applications
  • Human-Computer Interaction

Cite this

Nasution, A. H., Murakami, Y., & Ishida, T. (2017). Plan optimization for creating bilingual dictionaries of low-resource languages. In Proceedings - 2017 International Conference on Culture and Computing, Culture and Computing 2017 (pp. 35-41). (Proceedings - 2017 International Conference on Culture and Computing, Culture and Computing 2017; Vol. 2017-December). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/Culture.and.Computing.2017.21