The effect of score standardisation on topic set size design

Tetsuya Sakai*

*この研究の対応する著者

研究成果

2 被引用数 (Scopus)

抄録

Given a topic-by-run score matrix from past data, topic set size design methods can help test collection builders determine the number of topics to create for a new test collection from a statistical viewpoint. In this study, we apply a recently-proposed score standardisation method called std-AB to score matrices before applying topic set size design, and demonstrate its advantages. For topic set size design, std-AB suppresses score variances and thereby enables test collection builders to consider realistic choices of topic set sizes, and to handle unnormalised measures in the same way as normalised measures. In addition, even discrete measures that clearly violate normality assumptions look more continuous after applying std-AB, which may make them more suitable for statistically motivated topic set size design. Our experiments cover a variety of tasks and evaluation measures from NTCIR-12.

本文言語English
ホスト出版物のタイトルInformation Retrieval Technology - 12th Asia Information Retrieval Societies Conference, AIRS 2016, Proceedings
編集者Yi Chang, Ji-Rong Wen, Zhicheng Dou, Xin Zhao, Shaoping Ma, Yiqun Liu, Min Zhang
出版社Springer Verlag
ページ16-28
ページ数13
ISBN(印刷版)9783319480503
DOI
出版ステータスPublished - 2016
イベント12th Asia Information Retrieval Societies Conference, AIRS 2016 - Beijing, China
継続期間: 2016 11 302016 12 2

出版物シリーズ

名前Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
9994 LNCS
ISSN(印刷版)0302-9743
ISSN(電子版)1611-3349

Other

Other12th Asia Information Retrieval Societies Conference, AIRS 2016
国/地域China
CityBeijing
Period16/11/3016/12/2

ASJC Scopus subject areas

  • 理論的コンピュータサイエンス
  • コンピュータ サイエンス(全般)

フィンガープリント

「The effect of score standardisation on topic set size design」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル