Summarization of dynamic content in web collections

Adam Jatowt*, Mitsuru Ishizuka

*この研究の対応する著者

研究成果: Article査読

7 被引用数 (Scopus)

抄録

This paper describes a new research proposal of multi-document summarization of dynamic content in web pages. Much information is lost in the Web due to the temporal character of web documents. Therefore adapting summarization techniques to the web genre is a promising task. The aim of our research is to provide methods for summarizing volatile content retrieved from collections of topically related web pages over defined time periods. The resulting summary ideally would reflect the most popular topics and concepts found in retrospective web collections. Because of the content and time diversities of web changes, it is necessary to apply different techniques than standard methods used for static documents. In this paper we propose an initial solution to this summarization problem. Our approach exploits temporal similarities between web pages by utilizing sliding window concept over dynamic parts of the collection.

本文言語English
ページ(範囲)245-254
ページ数10
ジャーナルLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
3202
出版ステータスPublished - 2004
外部発表はい

ASJC Scopus subject areas

  • コンピュータ サイエンス(全般)
  • 生化学、遺伝学、分子生物学(全般)
  • 理論的コンピュータサイエンス

フィンガープリント

「Summarization of dynamic content in web collections」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル