Building a terabyte-scale web data collection "NW1000G-04" in the NTCIR-5 WEB task

Masao Takaku*, Keizo Oyama, Akiko Aizawa, Haruko Ishikawa, Kengo Minamide, Shin Kato, Hayato Yamana, Junya Hayashi

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

We built a terabyte-scale web data collection, NW1000G-04, which was used in the NTCIR-5 WEB task. This report describes the process of building the collection and some statistics of it in detail.

Original languageEnglish
Pages (from-to)1-8
Number of pages8
JournalNII Technical Reports
Volume2006
Issue number12
Publication statusPublished - 2006 Sep 7

ASJC Scopus subject areas

  • Information Systems
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'Building a terabyte-scale web data collection "NW1000G-04" in the NTCIR-5 WEB task'. Together they form a unique fingerprint.

Cite this