PlusDBG: Web community extraction scheme improving both precision and pseudo-recall

Naoyuki Saida, Akira Umezawa, Hayato Yamana

Research output: Contribution to journalConference article

Abstract

This paper proposes PlusDBG to improve both precision and pseudo-recall by extending the conventional Web community extraction scheme. Precision is defined as the percentage of relevant Web pages extracted as members of Web communities and pseudo-recall is defined as the sum of the number of relevant Web pages extracted as members of Web communities. The proposed scheme adopts the new distance parameter defined by the relevance between a Web page and a Web community, and extracts the Web community with higher precision and pseudo-recall. Moreover, we have implemented and evaluated the proposed scheme. Our results confirm that the proposed scheme is able to extract about 3.2-fold larger numbers of members of Web communities than the conventional scheme, while maintaining equivalent precision.

Original languageEnglish
Pages (from-to)938-943
Number of pages6
JournalLECTURE NOTES IN COMPUTER SCIENCE
Volume3399
DOIs
Publication statusPublished - 2005 Jan 1
Event7th Asia-Pacific Web Conference on Web Technologies Research and Development - APWeb 2005 - Shanghai, China
Duration: 2005 Mar 292005 Apr 1

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Fingerprint Dive into the research topics of 'PlusDBG: Web community extraction scheme improving both precision and pseudo-recall'. Together they form a unique fingerprint.

Cite this