Retrieval of personal Web documents by extracting subjective expressions

Takahiro Hayashi, Koji Abe, Rikio Onai

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper presents a method for gathering Japanese web documents which contain personal opinions. Our method is available as a pre-processing of applications for mining various opinions. In order to find personal documents on the Web, we focus on four kinds of subjective expressions: (1) negative meaning expressions, (2) final particles, (3) interjections, and (4) specific symbols such as face marks. Measuring the frequencies of these subjective expressions in a document, our method classifies web documents into personal and non-personal ones. Besides, our method gives the documents scores which show the accuracy of the classification results. We experimentally confirmed the effectiveness of the proposal using 1200 web documents. The experimental results have shown the precision and recall of the proposed classification are 0.70 and 0.87, respectively. In addition, we have confirmed that personal documents can be easily obtained by gathering documents which are given high scores.

Original languageEnglish
Title of host publicationProceedings - 22nd International Conference on Advanced Information Networking and Applications Workshops/Symposia, AINA 2008
Pages1187-1192
Number of pages6
DOIs
Publication statusPublished - 2008
Externally publishedYes
Event22nd International Conference on Advanced Information Networking and Applications Workshops/Symposia, AINA 2008 - Gino-wan, Okinawa
Duration: 2008 Mar 252008 Mar 28

Other

Other22nd International Conference on Advanced Information Networking and Applications Workshops/Symposia, AINA 2008
CityGino-wan, Okinawa
Period08/3/2508/3/28

Fingerprint

Processing

ASJC Scopus subject areas

  • Engineering(all)

Cite this

Hayashi, T., Abe, K., & Onai, R. (2008). Retrieval of personal Web documents by extracting subjective expressions. In Proceedings - 22nd International Conference on Advanced Information Networking and Applications Workshops/Symposia, AINA 2008 (pp. 1187-1192). [4483080] https://doi.org/10.1109/WAINA.2008.52

Retrieval of personal Web documents by extracting subjective expressions. / Hayashi, Takahiro; Abe, Koji; Onai, Rikio.

Proceedings - 22nd International Conference on Advanced Information Networking and Applications Workshops/Symposia, AINA 2008. 2008. p. 1187-1192 4483080.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Hayashi, T, Abe, K & Onai, R 2008, Retrieval of personal Web documents by extracting subjective expressions. in Proceedings - 22nd International Conference on Advanced Information Networking and Applications Workshops/Symposia, AINA 2008., 4483080, pp. 1187-1192, 22nd International Conference on Advanced Information Networking and Applications Workshops/Symposia, AINA 2008, Gino-wan, Okinawa, 08/3/25. https://doi.org/10.1109/WAINA.2008.52
Hayashi T, Abe K, Onai R. Retrieval of personal Web documents by extracting subjective expressions. In Proceedings - 22nd International Conference on Advanced Information Networking and Applications Workshops/Symposia, AINA 2008. 2008. p. 1187-1192. 4483080 https://doi.org/10.1109/WAINA.2008.52
Hayashi, Takahiro ; Abe, Koji ; Onai, Rikio. / Retrieval of personal Web documents by extracting subjective expressions. Proceedings - 22nd International Conference on Advanced Information Networking and Applications Workshops/Symposia, AINA 2008. 2008. pp. 1187-1192
@inproceedings{4ab5bd3e92bb44019bee3e9249126d1d,
title = "Retrieval of personal Web documents by extracting subjective expressions",
abstract = "This paper presents a method for gathering Japanese web documents which contain personal opinions. Our method is available as a pre-processing of applications for mining various opinions. In order to find personal documents on the Web, we focus on four kinds of subjective expressions: (1) negative meaning expressions, (2) final particles, (3) interjections, and (4) specific symbols such as face marks. Measuring the frequencies of these subjective expressions in a document, our method classifies web documents into personal and non-personal ones. Besides, our method gives the documents scores which show the accuracy of the classification results. We experimentally confirmed the effectiveness of the proposal using 1200 web documents. The experimental results have shown the precision and recall of the proposed classification are 0.70 and 0.87, respectively. In addition, we have confirmed that personal documents can be easily obtained by gathering documents which are given high scores.",
author = "Takahiro Hayashi and Koji Abe and Rikio Onai",
year = "2008",
doi = "10.1109/WAINA.2008.52",
language = "English",
isbn = "0769530966",
pages = "1187--1192",
booktitle = "Proceedings - 22nd International Conference on Advanced Information Networking and Applications Workshops/Symposia, AINA 2008",

}

TY - GEN

T1 - Retrieval of personal Web documents by extracting subjective expressions

AU - Hayashi, Takahiro

AU - Abe, Koji

AU - Onai, Rikio

PY - 2008

Y1 - 2008

N2 - This paper presents a method for gathering Japanese web documents which contain personal opinions. Our method is available as a pre-processing of applications for mining various opinions. In order to find personal documents on the Web, we focus on four kinds of subjective expressions: (1) negative meaning expressions, (2) final particles, (3) interjections, and (4) specific symbols such as face marks. Measuring the frequencies of these subjective expressions in a document, our method classifies web documents into personal and non-personal ones. Besides, our method gives the documents scores which show the accuracy of the classification results. We experimentally confirmed the effectiveness of the proposal using 1200 web documents. The experimental results have shown the precision and recall of the proposed classification are 0.70 and 0.87, respectively. In addition, we have confirmed that personal documents can be easily obtained by gathering documents which are given high scores.

AB - This paper presents a method for gathering Japanese web documents which contain personal opinions. Our method is available as a pre-processing of applications for mining various opinions. In order to find personal documents on the Web, we focus on four kinds of subjective expressions: (1) negative meaning expressions, (2) final particles, (3) interjections, and (4) specific symbols such as face marks. Measuring the frequencies of these subjective expressions in a document, our method classifies web documents into personal and non-personal ones. Besides, our method gives the documents scores which show the accuracy of the classification results. We experimentally confirmed the effectiveness of the proposal using 1200 web documents. The experimental results have shown the precision and recall of the proposed classification are 0.70 and 0.87, respectively. In addition, we have confirmed that personal documents can be easily obtained by gathering documents which are given high scores.

UR - http://www.scopus.com/inward/record.url?scp=50249098655&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=50249098655&partnerID=8YFLogxK

U2 - 10.1109/WAINA.2008.52

DO - 10.1109/WAINA.2008.52

M3 - Conference contribution

SN - 0769530966

SN - 9780769530963

SP - 1187

EP - 1192

BT - Proceedings - 22nd International Conference on Advanced Information Networking and Applications Workshops/Symposia, AINA 2008

ER -