Automatic creation of N-lingual synonymous word sets

Wu Yanchen*, Li Fang, Rie Tanaka, Toru Ishida

*この研究の対応する著者

研究成果: Conference contribution

1 被引用数 (Scopus)

抄録

Multilingual dictionaries are very useful in machine translations and natural language processing. However, a multilingual dictionary including all natural languages still does not exist. In this paper we propose a trustworthy method to automatically create multilingual dictionary represented by N-lingual synonymous word sets (N-tuples, hereafter). Based on the work of 3-lingual synonymous word sets, our method has extended 3-lingual to n-lingual synonymous word sets from multiple bilingual dictionaries. By matching and combining the triples instead of the binary relations in the bilingual dictionaries, the complexity of the problem is significantly reduced. Using this method, we created 4-lingual synonymous word sets among Chinese, Japanese, English and German. The evaluations indicate that our combining algorithm has effectively solved the error accumulation problem and achieved a very promising quality. In the example application, the 4-tuples are used to refine the translation quality of a multi-hop machine translator created on the Language Grid. It shows that utilizing the handy online services and uniform platform in research work is a good methodology.

本文言語English
ホスト出版物のタイトルProceedings of the 4th International Conference on Semantics, Knowledge, and Grid, SKG 2008
ページ141-148
ページ数8
DOI
出版ステータスPublished - 2008
外部発表はい
イベント4th International Conference on Semantics, Knowledge, and Grid, SKG 2008 - Beijing, China
継続期間: 2008 12月 32008 12月 5

出版物シリーズ

名前Proceedings of the 4th International Conference on Semantics, Knowledge, and Grid, SKG 2008

Conference

Conference4th International Conference on Semantics, Knowledge, and Grid, SKG 2008
国/地域China
CityBeijing
Period08/12/308/12/5

ASJC Scopus subject areas

  • 人工知能
  • コンピュータ サイエンスの応用
  • ソフトウェア

フィンガープリント

「Automatic creation of N-lingual synonymous word sets」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル