Abstract
This paper proposes PlusDBG to improve both precision and pseudo-recall by extending the conventional Web community extraction scheme. Precision is defined as the percentage of relevant Web pages extracted as members of Web communities and pseudo-recall is defined as the sum of the number of relevant Web pages extracted as members of Web communities. The proposed scheme adopts the new distance parameter defined by the relevance between a Web page and a Web community, and extracts the Web community with higher precision and pseudo-recall. Moreover, we have implemented and evaluated the proposed scheme. Our results confirm that the proposed scheme is able to extract about 3.2-fold larger numbers of members of Web communities than the conventional scheme, while maintaining equivalent precision.
Original language | English |
---|---|
Pages (from-to) | 938-943 |
Number of pages | 6 |
Journal | LECTURE NOTES IN COMPUTER SCIENCE |
Volume | 3399 |
DOIs | |
Publication status | Published - 2005 Jan 1 |
Event | 7th Asia-Pacific Web Conference on Web Technologies Research and Development - APWeb 2005 - Shanghai, China Duration: 2005 Mar 29 → 2005 Apr 1 |
ASJC Scopus subject areas
- Theoretical Computer Science
- Computer Science(all)