PlusDBG

Web community extraction scheme improving both precision and pseudo-recall

Naoyuki Saida, Akira Umezawa, Hayato Yamana

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Abstract

    This paper proposes PlusDBG to improve both precision and pseudo-recall by extending the conventional Web community extraction scheme. Precision is defined as the percentage of relevant Web pages extracted as members of Web communities and pseudo-recall is defined as the sum of the number of relevant Web pages extracted as members of Web communities. The proposed scheme adopts the new distance parameter defined by the relevance between a Web page and a Web community, and extracts the Web community with higher precision and pseudo-recall. Moreover, we have implemented and evaluated the proposed scheme. Our results confirm that the proposed scheme is able to extract about 3.2-fold larger numbers of members of Web communities than the conventional scheme, while maintaining equivalent precision.

    Original languageEnglish
    Title of host publicationLecture Notes in Computer Science
    EditorsY. Zhang, K. Tanaka, J.X. Yu, S. Wang, M. Li
    Pages938-943
    Number of pages6
    Volume3399
    Publication statusPublished - 2005
    Event7th Asia-Pacific Web Conference on Web Technologies Research and Development - APWeb 2005 - Shanghai, China
    Duration: 2005 Mar 292005 Apr 1

    Other

    Other7th Asia-Pacific Web Conference on Web Technologies Research and Development - APWeb 2005
    CountryChina
    CityShanghai
    Period05/3/2905/4/1

    Fingerprint

    World Wide Web
    Websites

    ASJC Scopus subject areas

    • Computer Science (miscellaneous)

    Cite this

    Saida, N., Umezawa, A., & Yamana, H. (2005). PlusDBG: Web community extraction scheme improving both precision and pseudo-recall. In Y. Zhang, K. Tanaka, J. X. Yu, S. Wang, & M. Li (Eds.), Lecture Notes in Computer Science (Vol. 3399, pp. 938-943)

    PlusDBG : Web community extraction scheme improving both precision and pseudo-recall. / Saida, Naoyuki; Umezawa, Akira; Yamana, Hayato.

    Lecture Notes in Computer Science. ed. / Y. Zhang; K. Tanaka; J.X. Yu; S. Wang; M. Li. Vol. 3399 2005. p. 938-943.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Saida, N, Umezawa, A & Yamana, H 2005, PlusDBG: Web community extraction scheme improving both precision and pseudo-recall. in Y Zhang, K Tanaka, JX Yu, S Wang & M Li (eds), Lecture Notes in Computer Science. vol. 3399, pp. 938-943, 7th Asia-Pacific Web Conference on Web Technologies Research and Development - APWeb 2005, Shanghai, China, 05/3/29.
    Saida N, Umezawa A, Yamana H. PlusDBG: Web community extraction scheme improving both precision and pseudo-recall. In Zhang Y, Tanaka K, Yu JX, Wang S, Li M, editors, Lecture Notes in Computer Science. Vol. 3399. 2005. p. 938-943
    Saida, Naoyuki ; Umezawa, Akira ; Yamana, Hayato. / PlusDBG : Web community extraction scheme improving both precision and pseudo-recall. Lecture Notes in Computer Science. editor / Y. Zhang ; K. Tanaka ; J.X. Yu ; S. Wang ; M. Li. Vol. 3399 2005. pp. 938-943
    @inproceedings{5c15de4ce13e47dd93f759b899121830,
    title = "PlusDBG: Web community extraction scheme improving both precision and pseudo-recall",
    abstract = "This paper proposes PlusDBG to improve both precision and pseudo-recall by extending the conventional Web community extraction scheme. Precision is defined as the percentage of relevant Web pages extracted as members of Web communities and pseudo-recall is defined as the sum of the number of relevant Web pages extracted as members of Web communities. The proposed scheme adopts the new distance parameter defined by the relevance between a Web page and a Web community, and extracts the Web community with higher precision and pseudo-recall. Moreover, we have implemented and evaluated the proposed scheme. Our results confirm that the proposed scheme is able to extract about 3.2-fold larger numbers of members of Web communities than the conventional scheme, while maintaining equivalent precision.",
    author = "Naoyuki Saida and Akira Umezawa and Hayato Yamana",
    year = "2005",
    language = "English",
    volume = "3399",
    pages = "938--943",
    editor = "Y. Zhang and K. Tanaka and J.X. Yu and S. Wang and M. Li",
    booktitle = "Lecture Notes in Computer Science",

    }

    TY - GEN

    T1 - PlusDBG

    T2 - Web community extraction scheme improving both precision and pseudo-recall

    AU - Saida, Naoyuki

    AU - Umezawa, Akira

    AU - Yamana, Hayato

    PY - 2005

    Y1 - 2005

    N2 - This paper proposes PlusDBG to improve both precision and pseudo-recall by extending the conventional Web community extraction scheme. Precision is defined as the percentage of relevant Web pages extracted as members of Web communities and pseudo-recall is defined as the sum of the number of relevant Web pages extracted as members of Web communities. The proposed scheme adopts the new distance parameter defined by the relevance between a Web page and a Web community, and extracts the Web community with higher precision and pseudo-recall. Moreover, we have implemented and evaluated the proposed scheme. Our results confirm that the proposed scheme is able to extract about 3.2-fold larger numbers of members of Web communities than the conventional scheme, while maintaining equivalent precision.

    AB - This paper proposes PlusDBG to improve both precision and pseudo-recall by extending the conventional Web community extraction scheme. Precision is defined as the percentage of relevant Web pages extracted as members of Web communities and pseudo-recall is defined as the sum of the number of relevant Web pages extracted as members of Web communities. The proposed scheme adopts the new distance parameter defined by the relevance between a Web page and a Web community, and extracts the Web community with higher precision and pseudo-recall. Moreover, we have implemented and evaluated the proposed scheme. Our results confirm that the proposed scheme is able to extract about 3.2-fold larger numbers of members of Web communities than the conventional scheme, while maintaining equivalent precision.

    UR - http://www.scopus.com/inward/record.url?scp=24144469342&partnerID=8YFLogxK

    UR - http://www.scopus.com/inward/citedby.url?scp=24144469342&partnerID=8YFLogxK

    M3 - Conference contribution

    VL - 3399

    SP - 938

    EP - 943

    BT - Lecture Notes in Computer Science

    A2 - Zhang, Y.

    A2 - Tanaka, K.

    A2 - Yu, J.X.

    A2 - Wang, S.

    A2 - Li, M.

    ER -