Data mining method from text database

Masahiro Kawano, Junzo Watada*, Takayuki Kawaura

*この研究の対応する著者

    研究成果: Conference contribution

    抄録

    Recently, various types of data are expected to get in information processing according to multi-media technology. Especially, linguistic data are employed in fuzzy systems as well as fuzzy numerical values. In this paper we propose a text minig method based on fuzzy quantification model. In the process of text mining, we will pursue the following steps: 1) Sentences included in a text in Japanese are broken down into words. 2) It is possible to realize common understanding using fuzzy thesaurus that enables us to translate words into synonyms or into upper concepts. In this paper, we employ the method to translate words using Chinese characters or continuous letters of Katakana more then one katakana letter (Japanese alphabet letter) into keywords. The method realizes the high speed of processing without any dictionary for separating words. Fuzzy multivariate analysis is employed to analyze such processed data and to abstract a latent mutual related structure under the data. In other words, we abstract the knowledge from the given text data. At the end we apply the method to mining the text information of libraries and Web pages distributed over a web network and discussing about the application to Kansei engineering.

    本文言語English
    ホスト出版物のタイトルLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    ページ1122-1128
    ページ数7
    3683 LNAI
    出版ステータスPublished - 2005
    イベント9th International Conference on Knowledge-Based Intelligent Information and Engineering Systems, KES 2005 - Melbourne
    継続期間: 2005 9月 142005 9月 16

    出版物シリーズ

    名前Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    3683 LNAI
    ISSN(印刷版)03029743
    ISSN(電子版)16113349

    Other

    Other9th International Conference on Knowledge-Based Intelligent Information and Engineering Systems, KES 2005
    CityMelbourne
    Period05/9/1405/9/16

    ASJC Scopus subject areas

    • コンピュータ サイエンス(全般)
    • 生化学、遺伝学、分子生物学(全般)
    • 理論的コンピュータサイエンス

    フィンガープリント

    「Data mining method from text database」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

    引用スタイル