A method of generating translations of unseen n-grams by using proportional analogy

Juan Luo, Yves Lepage

Research output: Contribution to journalArticle

1 Citation (Scopus)

Abstract

In recent years, statistical machine translation has gained much attention. The phrase-based statistical machine translation model has made significant advancement in translation quality over the word-based model. In this paper, we attempt to apply the technique of proportional analogy to statistical machine translation systems. We propose a novel approach to apply proportional analogy to generate translations of unseen n-grams from the phrase table for phrase-based statistical machine translation. Experiments are conducted with two datasets of different sizes. We also investigate two methods to integrate n-grams translations produced by proportional analogy into the state-of-the-art statistical machine translation system, Moses.1 The experimental results show that unseen n-grams translations generated using the technique of proportional analogy are rewarding for statistical machine translation systems with small datasets.

Original languageEnglish
JournalIEEJ Transactions on Electrical and Electronic Engineering
DOIs
Publication statusAccepted/In press - 2016

    Fingerprint

Keywords

  • Phrase table
  • Proportional analogy
  • Statistical machine translation
  • Unseen n-grams

ASJC Scopus subject areas

  • Electrical and Electronic Engineering

Cite this