POLYPHONET: An advanced social network extraction system from the Web

Yutaka Matsuo, Junichiro Mori, Masahiro Hamasaki, Takuichi Nishimura, Hideaki Takeda, Koiti Hasida, Mitsuru Ishizuka

Research output: Contribution to journalArticle

87 Citations (Scopus)

Abstract

Social networks play important roles in the Semantic Web: knowledge management, information retrieval, ubiquitous computing, and so on. We propose a social network extraction system called POLYPHONET, which employs several advanced techniques to extract relations of persons, to detect groups of persons, and to obtain keywords for a person. Search engines, especially Google, are used to measure co-occurrence of information and obtain Web documents. Several studies have used search engines to extract social networks from the Web, but our research advances the following points: first, we reduce the related methods into simple pseudocodes using Google so that we can build up integrated systems. Second, we develop several new algorithms for social network mining such as those to classify relations into categories, to make extraction scalable, and to obtain and utilize person-to-word relations. Third, every module is implemented in POLYPHONET, which has been used at four academic conferences, each with more than 500 participants. We overview that system. Finally, a novel architecture called Iterative Social Network Mining is proposed. It utilizes simple modules using Google and is characterized by scalability and relate-identify processes: identification of each entity and extraction of relations are repeated to obtain a more precise social network.

Original languageEnglish
Pages (from-to)262-278
Number of pages17
JournalWeb Semantics
Volume5
Issue number4
DOIs
Publication statusPublished - 2007 Dec
Externally publishedYes

Fingerprint

Search engines
World Wide Web
Ubiquitous computing
Knowledge management
Semantic Web
Information retrieval
Scalability
Identification (control systems)

Keywords

  • Search engine
  • Social network
  • Web mining

ASJC Scopus subject areas

  • Computational Theory and Mathematics
  • Computer Networks and Communications

Cite this

Matsuo, Y., Mori, J., Hamasaki, M., Nishimura, T., Takeda, H., Hasida, K., & Ishizuka, M. (2007). POLYPHONET: An advanced social network extraction system from the Web. Web Semantics, 5(4), 262-278. https://doi.org/10.1016/j.websem.2007.09.002

POLYPHONET : An advanced social network extraction system from the Web. / Matsuo, Yutaka; Mori, Junichiro; Hamasaki, Masahiro; Nishimura, Takuichi; Takeda, Hideaki; Hasida, Koiti; Ishizuka, Mitsuru.

In: Web Semantics, Vol. 5, No. 4, 12.2007, p. 262-278.

Research output: Contribution to journalArticle

Matsuo, Y, Mori, J, Hamasaki, M, Nishimura, T, Takeda, H, Hasida, K & Ishizuka, M 2007, 'POLYPHONET: An advanced social network extraction system from the Web', Web Semantics, vol. 5, no. 4, pp. 262-278. https://doi.org/10.1016/j.websem.2007.09.002
Matsuo Y, Mori J, Hamasaki M, Nishimura T, Takeda H, Hasida K et al. POLYPHONET: An advanced social network extraction system from the Web. Web Semantics. 2007 Dec;5(4):262-278. https://doi.org/10.1016/j.websem.2007.09.002
Matsuo, Yutaka ; Mori, Junichiro ; Hamasaki, Masahiro ; Nishimura, Takuichi ; Takeda, Hideaki ; Hasida, Koiti ; Ishizuka, Mitsuru. / POLYPHONET : An advanced social network extraction system from the Web. In: Web Semantics. 2007 ; Vol. 5, No. 4. pp. 262-278.
@article{9d8a37df60cc4445b7cb48bb1ebf6909,
title = "POLYPHONET: An advanced social network extraction system from the Web",
abstract = "Social networks play important roles in the Semantic Web: knowledge management, information retrieval, ubiquitous computing, and so on. We propose a social network extraction system called POLYPHONET, which employs several advanced techniques to extract relations of persons, to detect groups of persons, and to obtain keywords for a person. Search engines, especially Google, are used to measure co-occurrence of information and obtain Web documents. Several studies have used search engines to extract social networks from the Web, but our research advances the following points: first, we reduce the related methods into simple pseudocodes using Google so that we can build up integrated systems. Second, we develop several new algorithms for social network mining such as those to classify relations into categories, to make extraction scalable, and to obtain and utilize person-to-word relations. Third, every module is implemented in POLYPHONET, which has been used at four academic conferences, each with more than 500 participants. We overview that system. Finally, a novel architecture called Iterative Social Network Mining is proposed. It utilizes simple modules using Google and is characterized by scalability and relate-identify processes: identification of each entity and extraction of relations are repeated to obtain a more precise social network.",
keywords = "Search engine, Social network, Web mining",
author = "Yutaka Matsuo and Junichiro Mori and Masahiro Hamasaki and Takuichi Nishimura and Hideaki Takeda and Koiti Hasida and Mitsuru Ishizuka",
year = "2007",
month = "12",
doi = "10.1016/j.websem.2007.09.002",
language = "English",
volume = "5",
pages = "262--278",
journal = "Web Semantics",
issn = "1570-8268",
publisher = "Elsevier",
number = "4",

}

TY - JOUR

T1 - POLYPHONET

T2 - An advanced social network extraction system from the Web

AU - Matsuo, Yutaka

AU - Mori, Junichiro

AU - Hamasaki, Masahiro

AU - Nishimura, Takuichi

AU - Takeda, Hideaki

AU - Hasida, Koiti

AU - Ishizuka, Mitsuru

PY - 2007/12

Y1 - 2007/12

N2 - Social networks play important roles in the Semantic Web: knowledge management, information retrieval, ubiquitous computing, and so on. We propose a social network extraction system called POLYPHONET, which employs several advanced techniques to extract relations of persons, to detect groups of persons, and to obtain keywords for a person. Search engines, especially Google, are used to measure co-occurrence of information and obtain Web documents. Several studies have used search engines to extract social networks from the Web, but our research advances the following points: first, we reduce the related methods into simple pseudocodes using Google so that we can build up integrated systems. Second, we develop several new algorithms for social network mining such as those to classify relations into categories, to make extraction scalable, and to obtain and utilize person-to-word relations. Third, every module is implemented in POLYPHONET, which has been used at four academic conferences, each with more than 500 participants. We overview that system. Finally, a novel architecture called Iterative Social Network Mining is proposed. It utilizes simple modules using Google and is characterized by scalability and relate-identify processes: identification of each entity and extraction of relations are repeated to obtain a more precise social network.

AB - Social networks play important roles in the Semantic Web: knowledge management, information retrieval, ubiquitous computing, and so on. We propose a social network extraction system called POLYPHONET, which employs several advanced techniques to extract relations of persons, to detect groups of persons, and to obtain keywords for a person. Search engines, especially Google, are used to measure co-occurrence of information and obtain Web documents. Several studies have used search engines to extract social networks from the Web, but our research advances the following points: first, we reduce the related methods into simple pseudocodes using Google so that we can build up integrated systems. Second, we develop several new algorithms for social network mining such as those to classify relations into categories, to make extraction scalable, and to obtain and utilize person-to-word relations. Third, every module is implemented in POLYPHONET, which has been used at four academic conferences, each with more than 500 participants. We overview that system. Finally, a novel architecture called Iterative Social Network Mining is proposed. It utilizes simple modules using Google and is characterized by scalability and relate-identify processes: identification of each entity and extraction of relations are repeated to obtain a more precise social network.

KW - Search engine

KW - Social network

KW - Web mining

UR - http://www.scopus.com/inward/record.url?scp=36549074243&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=36549074243&partnerID=8YFLogxK

U2 - 10.1016/j.websem.2007.09.002

DO - 10.1016/j.websem.2007.09.002

M3 - Article

AN - SCOPUS:36549074243

VL - 5

SP - 262

EP - 278

JO - Web Semantics

JF - Web Semantics

SN - 1570-8268

IS - 4

ER -