Extracting key phrases to disambiguate personal names on the web

Danushka Bollegala, Yutaka Matsuo, Mitsuru Ishizuka

研究成果: Conference contribution

5 被引用数 (Scopus)

抄録

When you search for information regarding a particular person on the web, a search engine returns many pages. Some of these pages may be for people with the same name. How can we disambiguate these different people with the same name? This paper presents an unsupervised algorithm which produces key phrases for the different people with the same name. These key phrases could be used to further narrow down the search, leading to more person specific unambiguous information. The algorithm we propose does not require any biographical or social information regarding the person. Although there are some previous work in personal name disambiguation on the web, to our knowledge, this is the first attempt to extract key phrases to disambiguate the different persons with the same name. To evaluate our algorithm, we collected and hand labeled a dataset of over 1000 Web pages retrieved from Google using personal name queries. Our experimental results shows an improvement over the existing methods for namesake disambiguation.

本文言語English
ホスト出版物のタイトルLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
ページ223-234
ページ数12
3878 LNCS
DOI
出版ステータスPublished - 2006
外部発表はい
イベント7th International Conference on Computational Linguistics and Intelligent Text Processing, CICLing 2006 - Mexico City
継続期間: 2006 2 192006 2 25

出版物シリーズ

名前Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
3878 LNCS
ISSN(印刷版)03029743
ISSN(電子版)16113349

Other

Other7th International Conference on Computational Linguistics and Intelligent Text Processing, CICLing 2006
CityMexico City
Period06/2/1906/2/25

ASJC Scopus subject areas

  • コンピュータ サイエンス(全般)
  • 生化学、遺伝学、分子生物学(全般)
  • 理論的コンピュータサイエンス

フィンガープリント

「Extracting key phrases to disambiguate personal names on the web」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル