This paper proposes PlusDBG to improve both precision and pseudo-recall by extending the conventional Web community extraction scheme. Precision is defined as the percentage of relevant Web pages extracted as members of Web communities and pseudo-recall is defined as the sum of the number of relevant Web pages extracted as members of Web communities. The proposed scheme adopts the new distance parameter defined by the relevance between a Web page and a Web community, and extracts the Web community with higher precision and pseudo-recall. Moreover, we have implemented and evaluated the proposed scheme. Our results confirm that the proposed scheme is able to extract about 3.2-fold larger numbers of members of Web communities than the conventional scheme, while maintaining equivalent precision.
|ジャーナル||LECTURE NOTES IN COMPUTER SCIENCE|
|出版ステータス||Published - 2005 1月 1|
|イベント||7th Asia-Pacific Web Conference on Web Technologies Research and Development - APWeb 2005 - Shanghai, China|
継続期間: 2005 3月 29 → 2005 4月 1
ASJC Scopus subject areas
- コンピュータ サイエンス（全般）