Yet another symmetrical & real-time word alignment method: Hierarchical sub-sentential alignment using F-measure

Hao Wang, Yves Lepage

研究成果: Conference contribution

抄録

Symmetrization of word alignments is the fundamental issue in statistical machine translation (SMT). In this paper, we describe an novel reformulation of Hierarchical Subsentential Alignment (HSSA) method using F-measure. Starting with a soft alignment matrix, we use the F-measure to recursively split ENGL the matrix into two soft alignment submatrices. A direction is chosen as the same time on the basis of Inversion Transduction Grammar (ITG). In other words, our method simplifies the processing of word alignment as recursive segmentation in a bipartite graph, which is simple and easy to implement. It can be considered as an alternative of growdiag- final-and heuristic. We show its application on phrase-based SMT systems combined with the state-of-the-art approaches. In addition, by feeding with word-to-word associations, it also can be a real-time word aligner. Our experiments show that, given a reliable lexicon translation table, this simple method can yield comparable results with state-of-theart approaches.

本文言語English
ホスト出版物のタイトルProceedings of the 30th Pacific Asia Conference on Language, Information and Computation, PACLIC 2016
出版社Institute for the Study of Language and Information
ページ143-152
ページ数10
ISBN(電子版)9788968174285
出版ステータスPublished - 2016
イベント30th Pacific Asia Conference on Language, Information and Computation, PACLIC 2016 - Seoul, Korea, Republic of
継続期間: 2016 10 282016 10 30

Other

Other30th Pacific Asia Conference on Language, Information and Computation, PACLIC 2016
CountryKorea, Republic of
CitySeoul
Period16/10/2816/10/30

ASJC Scopus subject areas

  • Language and Linguistics
  • Computer Science (miscellaneous)
  • Information Systems

フィンガープリント 「Yet another symmetrical & real-time word alignment method: Hierarchical sub-sentential alignment using F-measure」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル