Extracting the author of web pages

Yoshikiyo Kato, Daisuke Kawahara, Kentaro Inui, Sadao Kurohashi, Tomohide Shibata

研究成果: Conference contribution

3 被引用数 (Scopus)

抄録

In this paper, we define the problem of identifying the author of a Web page as a sub-problem of identifying the information sender configuration of a Web page. We propose a method that extracts the author name candidates from a Web page based on linguistic features, and rank the candidates based on local features such as distance from the main content. The evaluation shows that we can achieve more than 75% precision when evaluated with candidates ranked within top five.

本文言語English
ホスト出版物のタイトルProceedings of the 2nd ACM Workshop on Information Credibility on the Web, WICOW'08, Co-located with the 17th ACM Conference on Information and Knowledge Management, CIKM'08
ページ35-41
ページ数7
DOI
出版ステータスPublished - 2008
外部発表はい
イベント2nd ACM Workshop on Information Credibility on the Web, WICOW'08, Co-located with the 17th ACM Conference on Information and Knowledge Management, CIKM'08 - Napa Valley, CA, United States
継続期間: 2008 10 262008 10 30

出版物シリーズ

名前International Conference on Information and Knowledge Management, Proceedings

Conference

Conference2nd ACM Workshop on Information Credibility on the Web, WICOW'08, Co-located with the 17th ACM Conference on Information and Knowledge Management, CIKM'08
CountryUnited States
CityNapa Valley, CA
Period08/10/2608/10/30

ASJC Scopus subject areas

  • Decision Sciences(all)
  • Business, Management and Accounting(all)

フィンガープリント 「Extracting the author of web pages」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル