Plan optimization for creating bilingual dictionaries of low-resource languages

Arbi Haza Nasution*, Yohei Murakami, Toru Ishida

*この研究の対応する著者

研究成果

8 被引用数 (Scopus)

抄録

The constraint-based approach has been proven useful for inducing bilingual lexicons for closely-related low-resource languages. When we want to create multiple bilingual dictionaries linking several languages, we need to consider manual creation by bilingual language experts if there are no available machine-readable dictionaries are available as input. To overcome the difficulty in planning the creation of bilingual dictionaries, the consideration of various methods and costs, plan optimization is essential. We adopt the Markov Decision Process (MDP) in formalizing plan optimization for creating bilingual dictionaries; the goal is to better predict the most feasible optimal plan with the least total cost before fully implementing the constraint-based bilingual dictionary induction framework. We define heuristics based on input language characteristics to devise a baseline plan for evaluating our MDP-based approach with total cost as an evaluation metric. The MDP-based proposal outperformed heuristic planning on total cost for all datasets examined.

本文言語English
ホスト出版物のタイトルProceedings - 2017 International Conference on Culture and Computing, Culture and Computing 2017
出版社Institute of Electrical and Electronics Engineers Inc.
ページ35-41
ページ数7
ISBN(電子版)9781538611357
DOI
出版ステータスPublished - 2017 12 19
外部発表はい
イベント2017 International Conference on Culture and Computing, Culture and Computing 2017 - Kyoto, Japan
継続期間: 2017 9 102017 9 12

出版物シリーズ

名前Proceedings - 2017 International Conference on Culture and Computing, Culture and Computing 2017
2017-December

Other

Other2017 International Conference on Culture and Computing, Culture and Computing 2017
国/地域Japan
CityKyoto
Period17/9/1017/9/12

ASJC Scopus subject areas

  • メディア記述
  • カルチュラル スタディーズ
  • コンピュータ サイエンスの応用
  • 人間とコンピュータの相互作用

フィンガープリント

「Plan optimization for creating bilingual dictionaries of low-resource languages」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル