Translation of unseen bigrams by analogy using an SVM classifier

Hao Wang, Lu Lyu, Yves Lepage

Research output: Contribution to conferencePaperpeer-review

Abstract

Detecting language divergences and predicting possible sub-translations is one of the most essential issues in machine translation. Since the existence of translation divergences, it is impractical to straightforward translate from source sentence into target sentence while keeping the high degree of accuracy and without additional information. In this paper, we investigate the problem from an emerging and special point of view: bigrams and the corresponding translations. We first profile corpora and explore the constituents of bigrams in the source language. Then we translate unseen bigrams based on proportional analogy and filter the outputs using an Support Vector Machine (SVM) classifier. The experiment results also show that even a small set of features from analogous can provide meaningful information in translating by analogy.

Original languageEnglish
Pages16-25
Number of pages10
Publication statusPublished - 2015
Event29th Pacific Asia Conference on Language, Information and Computation, PACLIC 2015 - Shanghai, China
Duration: 2015 Oct 302015 Nov 1

Other

Other29th Pacific Asia Conference on Language, Information and Computation, PACLIC 2015
Country/TerritoryChina
CityShanghai
Period15/10/3015/11/1

ASJC Scopus subject areas

  • Artificial Intelligence
  • Human-Computer Interaction
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'Translation of unseen bigrams by analogy using an SVM classifier'. Together they form a unique fingerprint.

Cite this