Building a terabyte-scale web data collection "NW1000G-04" in the NTCIR-5 WEB task

Masao Takaku, Keizo Oyama, Akiko Aizawa, Haruko Ishikawa, Kengo Minamide, Shin Kato, Hayato Yamana, Junya Hayashi

Research output: Contribution to journalArticle

Abstract

We built a terabyte-scale web data collection, NW1000G-04, which was used in the NTCIR-5 WEB task. This report describes the process of building the collection and some statistics of it in detail.

Original languageEnglish
Pages (from-to)1-8
Number of pages8
JournalNII Technical Reports
Volume2006
Issue number12
Publication statusPublished - 2006 Sep 7

ASJC Scopus subject areas

  • Information Systems
  • Computer Science Applications

Fingerprint Dive into the research topics of 'Building a terabyte-scale web data collection "NW1000G-04" in the NTCIR-5 WEB task'. Together they form a unique fingerprint.

  • Cite this

    Takaku, M., Oyama, K., Aizawa, A., Ishikawa, H., Minamide, K., Kato, S., Yamana, H., & Hayashi, J. (2006). Building a terabyte-scale web data collection "NW1000G-04" in the NTCIR-5 WEB task. NII Technical Reports, 2006(12), 1-8.