Computing cross-lingual synonym set similarity by using princeton annotated gloss corpus

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

This paper proposes a method to compute cross-lingual semantic similarity between synonym sets. By making use of Princeton Annotated Gloss Corpus as the source of target language statistics, the proposed method exhibited promising results in the experiments: More than 73% of the Princeton WordNet synsets were successfully recovered within the top-5 candidates, given a corresponding set of Japanese WordNet synsets. As the proposed method minimally requires that the input to be seen as an apparently synonymous word set, the method could be extended and the performance would be further improved by incorporating richer information such as textual glosses and/or structural constraints posed by the lexical resources at hand.

Original languageEnglish
Title of host publicationGWC 2012: 6th International Global Wordnet Conference, Proceedings
PublisherTribun EU s. r. o.
Pages134-141
Number of pages8
ISBN (Print)9788026302445
Publication statusPublished - 2012
Externally publishedYes
Event6th International Global Wordnet Conference, GWC 2012 - Matsue, Japan
Duration: 2012 Jan 92012 Jan 13

Other

Other6th International Global Wordnet Conference, GWC 2012
CountryJapan
CityMatsue
Period12/1/912/1/13

    Fingerprint

ASJC Scopus subject areas

  • Language and Linguistics
  • Literature and Literary Theory

Cite this

Hayashi, Y. (2012). Computing cross-lingual synonym set similarity by using princeton annotated gloss corpus. In GWC 2012: 6th International Global Wordnet Conference, Proceedings (pp. 134-141). Tribun EU s. r. o..