TSUBAKI: An open search engine infrastructure for developing information access methodology

Keiji Shinzato*, Tomohide Shibata, Daisuke Kawahara, Sadao Kurohashi

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

9 Citations (Scopus)

Abstract

Due to the explosive growth in the amount of information in the last decade, it is getting extremely harder to obtain necessary information by conventional information access methods. Hence, creation of drastically new technology is needed. For developing such new technology, search engine infrastructures are required. Although the existing search engine APIs can be regarded as such infrastructures, these APIs have several restrictions such as a limit on the number of API calls. To help the development of new technology, we are running an open search engine infrastructure, TSUBAKI, on a high-performance computing environment. In this paper, we describe TSUBAKI infrastructure.

Original languageEnglish
Pages (from-to)216-227
Number of pages12
JournalJournal of information processing
Volume20
Issue number1
DOIs
Publication statusPublished - 2012
Externally publishedYes

Keywords

  • Info-plosion
  • Natural language processing
  • Natural language search
  • Search engine infrastructure
  • Web-based data sharing

ASJC Scopus subject areas

  • Computer Science(all)

Fingerprint

Dive into the research topics of 'TSUBAKI: An open search engine infrastructure for developing information access methodology'. Together they form a unique fingerprint.

Cite this