Covering a sentence in form and meaning with fewer retrieved sentences

Yuan Liu, Yves Lepage

Research output: Contribution to conferencePaperpeer-review

Abstract

Retrieving similar sentences from a given collection of sentences is essential in a range of applications. In this work, we propose a novel method to retrieve several sentences that cover an input sentence in form and meaning with minimal redundancy, so as to enhance the overall coverage quality of the output sentences. We focus on the hierarchical granularity levels of sentence pieces, matching from common or similar n-grams to finer-grained words o subwords, using techniques from similar sentence retrieval and monolingual phrase alignment. Our method shows promising source and target coverage evaluation results when applied to parallel corpora. This shows the potential of our approach if integrated into an example-based machine translation system.

Original languageEnglish
Pages436-445
Number of pages10
Publication statusPublished - 2021
Event35th Pacific Asia Conference on Language, Information and Computation, PACLIC 2021 - Shanghai, China
Duration: 2021 Nov 52021 Nov 7

Conference

Conference35th Pacific Asia Conference on Language, Information and Computation, PACLIC 2021
Country/TerritoryChina
CityShanghai
Period21/11/521/11/7

ASJC Scopus subject areas

  • Artificial Intelligence
  • Human-Computer Interaction
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'Covering a sentence in form and meaning with fewer retrieved sentences'. Together they form a unique fingerprint.

Cite this