Data mining method from text database

Masahiro Kawano, Junzo Watada, Takayuki Kawaura

    研究成果: Conference contribution

    抄録

    Recently, various types of data are expected to get in information processing according to multi-media technology. Especially, linguistic data are employed in fuzzy systems as well as fuzzy numerical values. In this paper we propose a text minig method based on fuzzy quantification model. In the process of text mining, we will pursue the following steps: 1) Sentences included in a text in Japanese are broken down into words. 2) It is possible to realize common understanding using fuzzy thesaurus that enables us to translate words into synonyms or into upper concepts. In this paper, we employ the method to translate words using Chinese characters or continuous letters of Katakana more then one katakana letter (Japanese alphabet letter) into keywords. The method realizes the high speed of processing without any dictionary for separating words. Fuzzy multivariate analysis is employed to analyze such processed data and to abstract a latent mutual related structure under the data. In other words, we abstract the knowledge from the given text data. At the end we apply the method to mining the text information of libraries and Web pages distributed over a web network and discussing about the application to Kansei engineering.

    本文言語English
    ホスト出版物のタイトルLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    ページ1122-1128
    ページ数7
    3683 LNAI
    出版ステータスPublished - 2005
    イベント9th International Conference on Knowledge-Based Intelligent Information and Engineering Systems, KES 2005 - Melbourne
    継続期間: 2005 9 142005 9 16

    出版物シリーズ

    名前Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    3683 LNAI
    ISSN(印刷版)03029743
    ISSN(電子版)16113349

    Other

    Other9th International Conference on Knowledge-Based Intelligent Information and Engineering Systems, KES 2005
    CityMelbourne
    Period05/9/1405/9/16

    ASJC Scopus subject areas

    • Computer Science(all)
    • Biochemistry, Genetics and Molecular Biology(all)
    • Theoretical Computer Science

    フィンガープリント 「Data mining method from text database」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

    引用スタイル