Hit count reliability: How much can we trust hit counts?

Koh Satoh*, Hayato Yamana

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

6 Citations (Scopus)

Abstract

Recently, there have been numerous studies that rely on the number of search results, i.e., hit count. However, hit counts returned by search engines can vary unnaturally when observed on different days, and may contain large errors that affect researches that depend on those results. Such errors can result in low precision of machine translation, incorrect extraction of synonyms and other problems. Thus, it is indispensable to evaluate and to improve the reliability of hit counts. There exist several researches to show the phenomenon; however, none of previous researches have made clear how much we can trust them. In this paper, we propose hit counts' reliability metrics to quantitatively evaluate hit counts' reliability to improve hit count selection. The evaluation results with Google show that our metrics successfully adopt reliable hit counts - 99.8% precision, and skip to adopt unreliable hit counts - 74.3% precision.

Original languageEnglish
Title of host publicationWeb Technologies and Applications - 14th Asia-Pacific Web Conference, APWeb 2012, Proceedings
Pages751-758
Number of pages8
DOIs
Publication statusPublished - 2012 Apr 18
Event14th Asia Pacific Web Technology Conference, APWeb 2012 - Kunming, China
Duration: 2012 Apr 112012 Apr 13

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume7235 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference14th Asia Pacific Web Technology Conference, APWeb 2012
Country/TerritoryChina
CityKunming
Period12/4/1112/4/13

Keywords

  • Hit Count
  • Information Retrieval
  • Reliability
  • Search Engine

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Fingerprint

Dive into the research topics of 'Hit count reliability: How much can we trust hit counts?'. Together they form a unique fingerprint.

Cite this