TY - GEN
T1 - Improving the distribution of N-grams in phrase tables obtained by the sampling-based method
AU - Luo, Juan
AU - Lardilleux, Adrien
AU - Lepage, Yves
PY - 2014/1/1
Y1 - 2014/1/1
N2 - We describe an approach to improve the performance of sampling-based sub-sentential alignment method on translation tasks by investigating the distribution of n-grams in the phrase tables. This approach consists in enforcing the alignment of n-grams. We compare the quality of phrase translation tables output by this approach and that of the state-of-the-art estimation approach in statistical machine translation tasks. We report significant improvements for this approach and show that merging phrase tables outperforms the state-of-the-art techniques.
AB - We describe an approach to improve the performance of sampling-based sub-sentential alignment method on translation tasks by investigating the distribution of n-grams in the phrase tables. This approach consists in enforcing the alignment of n-grams. We compare the quality of phrase translation tables output by this approach and that of the state-of-the-art estimation approach in statistical machine translation tasks. We report significant improvements for this approach and show that merging phrase tables outperforms the state-of-the-art techniques.
KW - Statistical machine translation
KW - Sub-sentential alignment
UR - http://www.scopus.com/inward/record.url?scp=84905855861&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84905855861&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-08958-4_34
DO - 10.1007/978-3-319-08958-4_34
M3 - Conference contribution
AN - SCOPUS:84905855861
SN - 9783319089577
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 419
EP - 431
BT - Human Language Technology Challenges for Computer Science and Linguistics - 5th Language and Technology Conference, LTC 2011, Revised Selected Papers
PB - Springer Verlag
T2 - 5th Language and Technology Conference, LTC 2011
Y2 - 25 November 2011 through 27 November 2011
ER -