Using graded-relevance metrics for evaluating community QA answer selection

Tetsuya Sakai*, Yohei Seki, Daisuke Ishikawa, Kazuko Kuriyama, Noriko Kando, Chin Yew Lin

*この研究の対応する著者

研究成果: Conference contribution

29 被引用数 (Scopus)

抄録

Community Question Answering (CQA) sites such as Yahoo ! Answers have emerged as rich knowledge resources for information seekers. However, answers posted to CQA sites can be irrelevant, incomplete, redundant, incorrect, biased, ill-formed or even abusive. Hence, automatic selection of "good" answers for a given posted question is a practical research problem that will help us manage the quality of accumulated knowledge. One way to evaluate answer selection systems for CQA would be to use the Best Answers (BAs) that are readily available from the CQA sites. However, BAs may be biased, and even if they are not, there may be other good answers besides BAs. To remedy these two problems, we propose system evaluation methods that involve multiple answer assessors and graded-relevance information retrieval metrics. Our main findings from experiments using the NTCIR-8 CQA task data are that, using our evaluation methods, (a) we can detect many substantial differences between systems that would have been overlooked by BA-based evaluation; and (b) we can better identify hard questions (i.e. those that are handled poorly by many systems and therefore require focussed investigation) compared to BAbased evaluation. We therefore argue that our approach is useful for building effective CQA answer selection systems despite the cost of manual answer assessments.

本文言語English
ホスト出版物のタイトルProceedings of the 4th ACM International Conference on Web Search and Data Mining, WSDM 2011
ページ187-196
ページ数10
DOI
出版ステータスPublished - 2011
外部発表はい
イベント4th ACM International Conference on Web Search and Data Mining, WSDM 2011 - Hong Kong, China
継続期間: 2011 2 92011 2 12

出版物シリーズ

名前Proceedings of the 4th ACM International Conference on Web Search and Data Mining, WSDM 2011

Conference

Conference4th ACM International Conference on Web Search and Data Mining, WSDM 2011
国/地域China
CityHong Kong
Period11/2/911/2/12

ASJC Scopus subject areas

  • コンピュータ ネットワークおよび通信
  • コンピュータ サイエンスの応用
  • ソフトウェア

フィンガープリント

「Using graded-relevance metrics for evaluating community QA answer selection」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル