HSSA tree structures for BTG-based preordering in machine translation

Yujia Zhang, Hao Wang, Yves Lepage

研究成果: Conference contribution

抄録

The Hierarchical Sub-Sentential Alignment (HSSA) method is a method to obtain aligned binary tree structures for two aligned sentences in translation correspondence. We propose to use the binary aligned tree structures delivered by this method as training data for preordering prior to machine translation. For that, we learn a Bracketing Transduction Grammar (BTG) from these binary aligned tree structures. In two oracle experiments in English to Japanese and Japanese to English translation, we show that it is theoretically possible to outperform a baseline system with a default distortion limit of 6, by about 2.5 and 5 BLEU points and, 7 and 10 RIBES points respectively, when preordering the source sentences using the learnt preordering model and using a distortion limit of 0. An attempt at learning a preordering model and its results are also reported.

本文言語English
ホスト出版物のタイトルProceedings of the 30th Pacific Asia Conference on Language, Information and Computation, PACLIC 2016
出版社Institute for the Study of Language and Information
ページ123-132
ページ数10
ISBN(電子版)9788968174285
出版ステータスPublished - 2016
イベント30th Pacific Asia Conference on Language, Information and Computation, PACLIC 2016 - Seoul, Korea, Republic of
継続期間: 2016 10 282016 10 30

Other

Other30th Pacific Asia Conference on Language, Information and Computation, PACLIC 2016
CountryKorea, Republic of
CitySeoul
Period16/10/2816/10/30

ASJC Scopus subject areas

  • Language and Linguistics
  • Computer Science (miscellaneous)
  • Information Systems

フィンガープリント 「HSSA tree structures for BTG-based preordering in machine translation」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル