A heuristic framework for pivot-based bilingual dictionary induction

Mairidan Wushouer, Toru Ishida, Donghui Lin

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

High quality machine readable dictionaries are very useful, but such resources are rarely available for lower-density language pairs, especially for those that are closely related. In this paper, we proposed a heuristic framework that aims at inducing one-to-one mapping dictionary of a closely related language pair from available dictionaries where a distant language is involved. The key insight of the framework is the ability to create heuristics by using distant language as pivot, incorporate given heuristics, and an iterative induction mechanism that human interaction can be potentially integrated. An experiment based on basic heuristics regarding syntactics and semantics resulted in up to 85.2% correctness in target dictionary with correctness of major part reached 95.3%, which proved that we can perform automated creation of a high quality dictionary with our framework.

Original languageEnglish
Title of host publicationProceedings - 2013 International Conference on Culture and Computing, Culture and Computing 2013
PublisherIEEE Computer Society
Pages111-116
Number of pages6
ISBN (Print)9780769550473
DOIs
Publication statusPublished - 2013 Jan 1
Externally publishedYes
Event2013 International Conference on Culture and Computing, Culture and Computing 2013 - Kyoto, Japan
Duration: 2013 Sep 162013 Sep 18

Publication series

NameProceedings - 2013 International Conference on Culture and Computing, Culture and Computing 2013

Conference

Conference2013 International Conference on Culture and Computing, Culture and Computing 2013
CountryJapan
CityKyoto
Period13/9/1613/9/18

Fingerprint

Glossaries
Syntactics
Semantics
Experiments

Keywords

  • Dictionary induction
  • Heuristics
  • Iterative framework
  • Pivot language

ASJC Scopus subject areas

  • Software

Cite this

Wushouer, M., Ishida, T., & Lin, D. (2013). A heuristic framework for pivot-based bilingual dictionary induction. In Proceedings - 2013 International Conference on Culture and Computing, Culture and Computing 2013 (pp. 111-116). [6680340] (Proceedings - 2013 International Conference on Culture and Computing, Culture and Computing 2013). IEEE Computer Society. https://doi.org/10.1109/CultureComputing.2013.27

A heuristic framework for pivot-based bilingual dictionary induction. / Wushouer, Mairidan; Ishida, Toru; Lin, Donghui.

Proceedings - 2013 International Conference on Culture and Computing, Culture and Computing 2013. IEEE Computer Society, 2013. p. 111-116 6680340 (Proceedings - 2013 International Conference on Culture and Computing, Culture and Computing 2013).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Wushouer, M, Ishida, T & Lin, D 2013, A heuristic framework for pivot-based bilingual dictionary induction. in Proceedings - 2013 International Conference on Culture and Computing, Culture and Computing 2013., 6680340, Proceedings - 2013 International Conference on Culture and Computing, Culture and Computing 2013, IEEE Computer Society, pp. 111-116, 2013 International Conference on Culture and Computing, Culture and Computing 2013, Kyoto, Japan, 13/9/16. https://doi.org/10.1109/CultureComputing.2013.27
Wushouer M, Ishida T, Lin D. A heuristic framework for pivot-based bilingual dictionary induction. In Proceedings - 2013 International Conference on Culture and Computing, Culture and Computing 2013. IEEE Computer Society. 2013. p. 111-116. 6680340. (Proceedings - 2013 International Conference on Culture and Computing, Culture and Computing 2013). https://doi.org/10.1109/CultureComputing.2013.27
Wushouer, Mairidan ; Ishida, Toru ; Lin, Donghui. / A heuristic framework for pivot-based bilingual dictionary induction. Proceedings - 2013 International Conference on Culture and Computing, Culture and Computing 2013. IEEE Computer Society, 2013. pp. 111-116 (Proceedings - 2013 International Conference on Culture and Computing, Culture and Computing 2013).
@inproceedings{f75f2ee89a9b474cb630a35a8aeb64dc,
title = "A heuristic framework for pivot-based bilingual dictionary induction",
abstract = "High quality machine readable dictionaries are very useful, but such resources are rarely available for lower-density language pairs, especially for those that are closely related. In this paper, we proposed a heuristic framework that aims at inducing one-to-one mapping dictionary of a closely related language pair from available dictionaries where a distant language is involved. The key insight of the framework is the ability to create heuristics by using distant language as pivot, incorporate given heuristics, and an iterative induction mechanism that human interaction can be potentially integrated. An experiment based on basic heuristics regarding syntactics and semantics resulted in up to 85.2{\%} correctness in target dictionary with correctness of major part reached 95.3{\%}, which proved that we can perform automated creation of a high quality dictionary with our framework.",
keywords = "Dictionary induction, Heuristics, Iterative framework, Pivot language",
author = "Mairidan Wushouer and Toru Ishida and Donghui Lin",
year = "2013",
month = "1",
day = "1",
doi = "10.1109/CultureComputing.2013.27",
language = "English",
isbn = "9780769550473",
series = "Proceedings - 2013 International Conference on Culture and Computing, Culture and Computing 2013",
publisher = "IEEE Computer Society",
pages = "111--116",
booktitle = "Proceedings - 2013 International Conference on Culture and Computing, Culture and Computing 2013",

}

TY - GEN

T1 - A heuristic framework for pivot-based bilingual dictionary induction

AU - Wushouer, Mairidan

AU - Ishida, Toru

AU - Lin, Donghui

PY - 2013/1/1

Y1 - 2013/1/1

N2 - High quality machine readable dictionaries are very useful, but such resources are rarely available for lower-density language pairs, especially for those that are closely related. In this paper, we proposed a heuristic framework that aims at inducing one-to-one mapping dictionary of a closely related language pair from available dictionaries where a distant language is involved. The key insight of the framework is the ability to create heuristics by using distant language as pivot, incorporate given heuristics, and an iterative induction mechanism that human interaction can be potentially integrated. An experiment based on basic heuristics regarding syntactics and semantics resulted in up to 85.2% correctness in target dictionary with correctness of major part reached 95.3%, which proved that we can perform automated creation of a high quality dictionary with our framework.

AB - High quality machine readable dictionaries are very useful, but such resources are rarely available for lower-density language pairs, especially for those that are closely related. In this paper, we proposed a heuristic framework that aims at inducing one-to-one mapping dictionary of a closely related language pair from available dictionaries where a distant language is involved. The key insight of the framework is the ability to create heuristics by using distant language as pivot, incorporate given heuristics, and an iterative induction mechanism that human interaction can be potentially integrated. An experiment based on basic heuristics regarding syntactics and semantics resulted in up to 85.2% correctness in target dictionary with correctness of major part reached 95.3%, which proved that we can perform automated creation of a high quality dictionary with our framework.

KW - Dictionary induction

KW - Heuristics

KW - Iterative framework

KW - Pivot language

UR - http://www.scopus.com/inward/record.url?scp=84893252484&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84893252484&partnerID=8YFLogxK

U2 - 10.1109/CultureComputing.2013.27

DO - 10.1109/CultureComputing.2013.27

M3 - Conference contribution

AN - SCOPUS:84893252484

SN - 9780769550473

T3 - Proceedings - 2013 International Conference on Culture and Computing, Culture and Computing 2013

SP - 111

EP - 116

BT - Proceedings - 2013 International Conference on Culture and Computing, Culture and Computing 2013

PB - IEEE Computer Society

ER -