A bottom-up approach to sentence ordering for multi-document summarization

Danushka Bollegala, Naoaki Okazaki, Mitsuru Ishizuka

研究成果: Article査読

35 被引用数 (Scopus)

抄録

Ordering information is a difficult but important task for applications generating natural language texts such as multi-document summarization, question answering, and concept-to-text generation. In multi-document summarization, information is selected from a set of source documents. However, improper ordering of information in a summary can confuse the reader and deteriorate the readability of the summary. Therefore, it is vital to properly order the information in multi-document summarization. We present a bottom-up approach to arrange sentences extracted for multi-document summarization. To capture the association and order of two textual segments (e.g. sentences), we define four criteria: chronology, topical-closeness, precedence, and succession. These criteria are integrated into a criterion by a supervised learning approach. We repeatedly concatenate two textual segments into one segment based on the criterion, until we obtain the overall segment with all sentences arranged. We evaluate the sentence orderings produced by the proposed method and numerous baselines using subjective gradings as well as automatic evaluation measures. We introduce the average continuity, an automatic evaluation measure of sentence ordering in a summary, and investigate its appropriateness for this task.

本文言語English
ページ(範囲)89-109
ページ数21
ジャーナルInformation Processing and Management
46
1
DOI
出版ステータスPublished - 2010 1
外部発表はい

ASJC Scopus subject areas

  • メディア記述
  • 情報システム
  • コンピュータ サイエンスの応用
  • 図書館情報学
  • 経営科学およびオペレーションズ リサーチ

フィンガープリント

「A bottom-up approach to sentence ordering for multi-document summarization」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル