Finding High Quality Documents through Link and Click Graphs

Linfeng Yu, Mizuho Iwaihara

研究成果

1 被引用数 (Scopus)

抄録

Link graphs of web pages have been utilized to evaluate importance of each page. Existing link analysis algorithms, including HITS and PageRank, exploit static link connectivity between pages. On the other hand, service providers often record HTTP requests that contain the resource and referrer of each request, from which we can construct a click graph that has edge weights representing the times of clicks on each link, or link traffic. Click graphs reflect users' choices of interesting links, thus the graphs are useful for evaluating importance of pages. However, clicks are often skewed onto highly popular links, so that click graphs only could not properly evaluate less clicked pages. In this paper, we propose an algorithm called click count-weighted HITS algorithm, which integrates HITS algorithm with click graphs, for finding high quality documents. Our evaluations on finding featured articles of English Wikipedia show that our click count-weighted HITS algorithm shows better performance on a large Wikipedia corpus than algorithms that utilize link graphs or click graphs only.

本文言語English
ホスト出版物のタイトルProceedings - 2018 7th International Congress on Advanced Applied Informatics, IIAI-AAI 2018
出版社Institute of Electrical and Electronics Engineers Inc.
ページ49-54
ページ数6
ISBN(電子版)9781538674475
DOI
出版ステータスPublished - 2019 4 16
イベント7th International Congress on Advanced Applied Informatics, IIAI-AAI 2018 - Yonago, Japan
継続期間: 2018 7 82018 7 13

出版物シリーズ

名前Proceedings - 2018 7th International Congress on Advanced Applied Informatics, IIAI-AAI 2018

Conference

Conference7th International Congress on Advanced Applied Informatics, IIAI-AAI 2018
国/地域Japan
CityYonago
Period18/7/818/7/13

ASJC Scopus subject areas

  • コンピュータ ネットワークおよび通信
  • 通信
  • 情報システム
  • 情報システムおよび情報管理
  • 教育

フィンガープリント

「Finding High Quality Documents through Link and Click Graphs」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル