Marker-based chunking in eleven European languages for analogy-based translation

Kota Takeya, Yves Lepage

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

An example-based machine translation (EBMT) system based on proportional analogies requires numerous proportional analogies between linguistic units to work properly. Consequently, long sentences cannot be handled directly in such a framework. Cutting sentences into chunks would be a solution. Using different markers, we count the number of proportional analogies between chunks in 11 European languages. As expected, the number of proportional analogies between chunks found is very high. These results, and preliminary experiments in translation, are promising for the EBMT system that we intend to build.

Original languageEnglish
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
PublisherSpringer Verlag
Pages432-444
Number of pages13
Volume8387 LNAI
ISBN (Print)9783319089577
DOIs
Publication statusPublished - 2014
Event5th Language and Technology Conference, LTC 2011 - Poznan
Duration: 2011 Nov 252011 Nov 27

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume8387 LNAI
ISSN (Print)03029743
ISSN (Electronic)16113349

Other

Other5th Language and Technology Conference, LTC 2011
CityPoznan
Period11/11/2511/11/27

Fingerprint

Linguistics
Analogy
Directly proportional
Machine Translation
Experiments
Count
Unit
Language
Experiment

ASJC Scopus subject areas

  • Computer Science(all)
  • Theoretical Computer Science

Cite this

Takeya, K., & Lepage, Y. (2014). Marker-based chunking in eleven European languages for analogy-based translation. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8387 LNAI, pp. 432-444). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 8387 LNAI). Springer Verlag. https://doi.org/10.1007/978-3-319-08958-4_35

Marker-based chunking in eleven European languages for analogy-based translation. / Takeya, Kota; Lepage, Yves.

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 8387 LNAI Springer Verlag, 2014. p. 432-444 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 8387 LNAI).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Takeya, K & Lepage, Y 2014, Marker-based chunking in eleven European languages for analogy-based translation. in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). vol. 8387 LNAI, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 8387 LNAI, Springer Verlag, pp. 432-444, 5th Language and Technology Conference, LTC 2011, Poznan, 11/11/25. https://doi.org/10.1007/978-3-319-08958-4_35
Takeya K, Lepage Y. Marker-based chunking in eleven European languages for analogy-based translation. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 8387 LNAI. Springer Verlag. 2014. p. 432-444. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/978-3-319-08958-4_35
Takeya, Kota ; Lepage, Yves. / Marker-based chunking in eleven European languages for analogy-based translation. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 8387 LNAI Springer Verlag, 2014. pp. 432-444 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{8069ed4b19de4d81b21f98eb5c5ca77d,
title = "Marker-based chunking in eleven European languages for analogy-based translation",
abstract = "An example-based machine translation (EBMT) system based on proportional analogies requires numerous proportional analogies between linguistic units to work properly. Consequently, long sentences cannot be handled directly in such a framework. Cutting sentences into chunks would be a solution. Using different markers, we count the number of proportional analogies between chunks in 11 European languages. As expected, the number of proportional analogies between chunks found is very high. These results, and preliminary experiments in translation, are promising for the EBMT system that we intend to build.",
author = "Kota Takeya and Yves Lepage",
year = "2014",
doi = "10.1007/978-3-319-08958-4_35",
language = "English",
isbn = "9783319089577",
volume = "8387 LNAI",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer Verlag",
pages = "432--444",
booktitle = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

}

TY - GEN

T1 - Marker-based chunking in eleven European languages for analogy-based translation

AU - Takeya, Kota

AU - Lepage, Yves

PY - 2014

Y1 - 2014

N2 - An example-based machine translation (EBMT) system based on proportional analogies requires numerous proportional analogies between linguistic units to work properly. Consequently, long sentences cannot be handled directly in such a framework. Cutting sentences into chunks would be a solution. Using different markers, we count the number of proportional analogies between chunks in 11 European languages. As expected, the number of proportional analogies between chunks found is very high. These results, and preliminary experiments in translation, are promising for the EBMT system that we intend to build.

AB - An example-based machine translation (EBMT) system based on proportional analogies requires numerous proportional analogies between linguistic units to work properly. Consequently, long sentences cannot be handled directly in such a framework. Cutting sentences into chunks would be a solution. Using different markers, we count the number of proportional analogies between chunks in 11 European languages. As expected, the number of proportional analogies between chunks found is very high. These results, and preliminary experiments in translation, are promising for the EBMT system that we intend to build.

UR - http://www.scopus.com/inward/record.url?scp=84905821718&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84905821718&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-08958-4_35

DO - 10.1007/978-3-319-08958-4_35

M3 - Conference contribution

SN - 9783319089577

VL - 8387 LNAI

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 432

EP - 444

BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

PB - Springer Verlag

ER -