Low-cost, bottom-up measures for evaluating search result diversification

Zhicheng Dou*, Xue Yang, Diya Li, Ji Rong Wen, Tetsuya Sakai

*この研究の対応する著者

研究成果: Article査読

2 被引用数 (Scopus)

抄録

Search result diversification aims at covering different user intents by returning a diversified document list. Most existing diversity measures require a predefined set of intents for a given query, where it is assumed that there is no relationship across these intents. However, studies have shown that modeling a hierarchy of intents has some benefits over the standard measure of using a flat list of intents. Intuitively, having more layers in the intent hierarchy seems to imply that we can consider more intricate relationships between intents and thereby identify subtle differences between documents that cover different intents. On the other hand, manually building a rich intent hierarchy imposes extra cost and is probably not very practical. In light of these considerations, we first propose a measure to build a hierarchy of intents from a given set of flat intents by clustering per-intent relevant documents and thereby identifying subintents. Furthermore, in our second measure, we consider a variant of our first measure that clusters per-topic relevance documents rather than per-intent ones, which is also intent-free. In addition, we propose our third measure, a simple, completely intent-free measure to search result diversity evaluation, which leverages document similarities. Our experiments based on TREC Web Track 2009–2013 test collections show that our proposed measures have advantages over existing diversity measures despite their low annotation costs.

本文言語English
ページ(範囲)86-113
ページ数28
ジャーナルInformation Retrieval Journal
23
1
DOI
出版ステータスPublished - 2020 2 1

ASJC Scopus subject areas

  • 情報システム
  • 図書館情報学

フィンガープリント

「Low-cost, bottom-up measures for evaluating search result diversification」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル