Translation of unseen bigrams by analogy using an SVM classifier

Hao Wang, Lu Lyu, Yves Lepage

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Detecting language divergences and predicting possible sub-translations is one of the most essential issues in machine translation. Since the existence of translation divergences, it is impractical to straightforward translate from source sentence into target sentence while keeping the high degree of accuracy and without additional information. In this paper, we investigate the problem from an emerging and special point of view: bigrams and the corresponding translations. We first profile corpora and explore the constituents of bigrams in the source language. Then we translate unseen bigrams based on proportional analogy and filter the outputs using an Support Vector Machine (SVM) classifier. The experiment results also show that even a small set of features from analogous can provide meaningful information in translating by analogy.

Original languageEnglish
Title of host publication29th Pacific Asia Conference on Language, Information and Computation, PACLIC 2015
PublisherShanghai Jiao Tong University
Pages16-25
Number of pages10
Publication statusPublished - 2015
Event29th Pacific Asia Conference on Language, Information and Computation, PACLIC 2015 - Shanghai, China
Duration: 2015 Oct 302015 Nov 1

Other

Other29th Pacific Asia Conference on Language, Information and Computation, PACLIC 2015
CountryChina
CityShanghai
Period15/10/3015/11/1

Fingerprint

Support vector machines
Classifiers
divergence
Experiments
language
experiment

ASJC Scopus subject areas

  • Artificial Intelligence
  • Human-Computer Interaction
  • Linguistics and Language

Cite this

Wang, H., Lyu, L., & Lepage, Y. (2015). Translation of unseen bigrams by analogy using an SVM classifier. In 29th Pacific Asia Conference on Language, Information and Computation, PACLIC 2015 (pp. 16-25). Shanghai Jiao Tong University.

Translation of unseen bigrams by analogy using an SVM classifier. / Wang, Hao; Lyu, Lu; Lepage, Yves.

29th Pacific Asia Conference on Language, Information and Computation, PACLIC 2015. Shanghai Jiao Tong University, 2015. p. 16-25.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Wang, H, Lyu, L & Lepage, Y 2015, Translation of unseen bigrams by analogy using an SVM classifier. in 29th Pacific Asia Conference on Language, Information and Computation, PACLIC 2015. Shanghai Jiao Tong University, pp. 16-25, 29th Pacific Asia Conference on Language, Information and Computation, PACLIC 2015, Shanghai, China, 15/10/30.
Wang H, Lyu L, Lepage Y. Translation of unseen bigrams by analogy using an SVM classifier. In 29th Pacific Asia Conference on Language, Information and Computation, PACLIC 2015. Shanghai Jiao Tong University. 2015. p. 16-25
Wang, Hao ; Lyu, Lu ; Lepage, Yves. / Translation of unseen bigrams by analogy using an SVM classifier. 29th Pacific Asia Conference on Language, Information and Computation, PACLIC 2015. Shanghai Jiao Tong University, 2015. pp. 16-25
@inproceedings{906d42e874494a4fb84878bc216d96e7,
title = "Translation of unseen bigrams by analogy using an SVM classifier",
abstract = "Detecting language divergences and predicting possible sub-translations is one of the most essential issues in machine translation. Since the existence of translation divergences, it is impractical to straightforward translate from source sentence into target sentence while keeping the high degree of accuracy and without additional information. In this paper, we investigate the problem from an emerging and special point of view: bigrams and the corresponding translations. We first profile corpora and explore the constituents of bigrams in the source language. Then we translate unseen bigrams based on proportional analogy and filter the outputs using an Support Vector Machine (SVM) classifier. The experiment results also show that even a small set of features from analogous can provide meaningful information in translating by analogy.",
author = "Hao Wang and Lu Lyu and Yves Lepage",
year = "2015",
language = "English",
pages = "16--25",
booktitle = "29th Pacific Asia Conference on Language, Information and Computation, PACLIC 2015",
publisher = "Shanghai Jiao Tong University",

}

TY - GEN

T1 - Translation of unseen bigrams by analogy using an SVM classifier

AU - Wang, Hao

AU - Lyu, Lu

AU - Lepage, Yves

PY - 2015

Y1 - 2015

N2 - Detecting language divergences and predicting possible sub-translations is one of the most essential issues in machine translation. Since the existence of translation divergences, it is impractical to straightforward translate from source sentence into target sentence while keeping the high degree of accuracy and without additional information. In this paper, we investigate the problem from an emerging and special point of view: bigrams and the corresponding translations. We first profile corpora and explore the constituents of bigrams in the source language. Then we translate unseen bigrams based on proportional analogy and filter the outputs using an Support Vector Machine (SVM) classifier. The experiment results also show that even a small set of features from analogous can provide meaningful information in translating by analogy.

AB - Detecting language divergences and predicting possible sub-translations is one of the most essential issues in machine translation. Since the existence of translation divergences, it is impractical to straightforward translate from source sentence into target sentence while keeping the high degree of accuracy and without additional information. In this paper, we investigate the problem from an emerging and special point of view: bigrams and the corresponding translations. We first profile corpora and explore the constituents of bigrams in the source language. Then we translate unseen bigrams based on proportional analogy and filter the outputs using an Support Vector Machine (SVM) classifier. The experiment results also show that even a small set of features from analogous can provide meaningful information in translating by analogy.

UR - http://www.scopus.com/inward/record.url?scp=84967184852&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84967184852&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:84967184852

SP - 16

EP - 25

BT - 29th Pacific Asia Conference on Language, Information and Computation, PACLIC 2015

PB - Shanghai Jiao Tong University

ER -