MT-based Japanese-English cross-language IR experiments using the TREC test collections

Tetsuya Sakai*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

9 Citations (Scopus)

Abstract

This paper evaluates the effectiveness of MT-based Japanese-English CLIR using a subcollection of the TREC test collections and two bilingual researchers to separately translate the TREC requests into Japanese. Our main findings are as follows: (1)With the aid of pseudo-relevance feedback, MT-based J-E CLIR can be as effective as "best-case" monolingual IR. In particular, although poor MT quality often leads to poor initial CLIR performance, pseudo-relevance feedback is useful for alleviating the harm; (2)The manual request translation process that is inherent in conventional CLIR performance evaluation can be a more dominant factor than query length and overall MT quality.

Original languageEnglish
Title of host publicationProceedings of the 5th international Workshop on Information Retrieval with Asian Languages, IRAL 2000
PublisherAssociation for Computing Machinery, Inc
Pages181-188
Number of pages8
ISBN (Electronic)1581133006, 9781581133004
DOIs
Publication statusPublished - 2000 Nov 1
Externally publishedYes
Event5th International Workshop on Information Retrieval with Asian Languages, IRAL 2000 - Hong Kong, China
Duration: 2000 Sept 302000 Oct 1

Publication series

NameProceedings of the 5th international Workshop on Information Retrieval with Asian Languages, IRAL 2000

Other

Other5th International Workshop on Information Retrieval with Asian Languages, IRAL 2000
Country/TerritoryChina
CityHong Kong
Period00/9/3000/10/1

Keywords

  • Cross-language information retrieval
  • Machine translation

ASJC Scopus subject areas

  • Computer Science Applications
  • Information Systems

Fingerprint

Dive into the research topics of 'MT-based Japanese-English cross-language IR experiments using the TREC test collections'. Together they form a unique fingerprint.

Cite this