Data mining method from text database

Masahiro Kawano, Junzo Watada, Takayuki Kawaura

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Abstract

    Recently, various types of data are expected to get in information processing according to multi-media technology. Especially, linguistic data are employed in fuzzy systems as well as fuzzy numerical values. In this paper we propose a text minig method based on fuzzy quantification model. In the process of text mining, we will pursue the following steps: 1) Sentences included in a text in Japanese are broken down into words. 2) It is possible to realize common understanding using fuzzy thesaurus that enables us to translate words into synonyms or into upper concepts. In this paper, we employ the method to translate words using Chinese characters or continuous letters of Katakana more then one katakana letter (Japanese alphabet letter) into keywords. The method realizes the high speed of processing without any dictionary for separating words. Fuzzy multivariate analysis is employed to analyze such processed data and to abstract a latent mutual related structure under the data. In other words, we abstract the knowledge from the given text data. At the end we apply the method to mining the text information of libraries and Web pages distributed over a web network and discussing about the application to Kansei engineering.

    Original languageEnglish
    Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Pages1122-1128
    Number of pages7
    Volume3683 LNAI
    Publication statusPublished - 2005
    Event9th International Conference on Knowledge-Based Intelligent Information and Engineering Systems, KES 2005 - Melbourne
    Duration: 2005 Sep 142005 Sep 16

    Publication series

    NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Volume3683 LNAI
    ISSN (Print)03029743
    ISSN (Electronic)16113349

    Other

    Other9th International Conference on Knowledge-Based Intelligent Information and Engineering Systems, KES 2005
    CityMelbourne
    Period05/9/1405/9/16

    Fingerprint

    Thesauri
    Data Mining
    Fuzzy systems
    Glossaries
    Linguistics
    World Wide Web
    Data mining
    Websites
    Databases
    Processing
    Controlled Vocabulary
    Automatic Data Processing
    Thesaurus
    Libraries
    Multivariate Analysis
    Text Mining
    Information Processing
    Fuzzy Systems
    Quantification
    Multimedia

    Keywords

    • Fuzzy quantification analysis
    • Library data
    • Text mining

    ASJC Scopus subject areas

    • Computer Science(all)
    • Biochemistry, Genetics and Molecular Biology(all)
    • Theoretical Computer Science

    Cite this

    Kawano, M., Watada, J., & Kawaura, T. (2005). Data mining method from text database. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3683 LNAI, pp. 1122-1128). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 3683 LNAI).

    Data mining method from text database. / Kawano, Masahiro; Watada, Junzo; Kawaura, Takayuki.

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 3683 LNAI 2005. p. 1122-1128 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 3683 LNAI).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Kawano, M, Watada, J & Kawaura, T 2005, Data mining method from text database. in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). vol. 3683 LNAI, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 3683 LNAI, pp. 1122-1128, 9th International Conference on Knowledge-Based Intelligent Information and Engineering Systems, KES 2005, Melbourne, 05/9/14.
    Kawano M, Watada J, Kawaura T. Data mining method from text database. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 3683 LNAI. 2005. p. 1122-1128. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
    Kawano, Masahiro ; Watada, Junzo ; Kawaura, Takayuki. / Data mining method from text database. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 3683 LNAI 2005. pp. 1122-1128 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
    @inproceedings{b1b4ccbdfee547c5b1b3009f47c63b3b,
    title = "Data mining method from text database",
    abstract = "Recently, various types of data are expected to get in information processing according to multi-media technology. Especially, linguistic data are employed in fuzzy systems as well as fuzzy numerical values. In this paper we propose a text minig method based on fuzzy quantification model. In the process of text mining, we will pursue the following steps: 1) Sentences included in a text in Japanese are broken down into words. 2) It is possible to realize common understanding using fuzzy thesaurus that enables us to translate words into synonyms or into upper concepts. In this paper, we employ the method to translate words using Chinese characters or continuous letters of Katakana more then one katakana letter (Japanese alphabet letter) into keywords. The method realizes the high speed of processing without any dictionary for separating words. Fuzzy multivariate analysis is employed to analyze such processed data and to abstract a latent mutual related structure under the data. In other words, we abstract the knowledge from the given text data. At the end we apply the method to mining the text information of libraries and Web pages distributed over a web network and discussing about the application to Kansei engineering.",
    keywords = "Fuzzy quantification analysis, Library data, Text mining",
    author = "Masahiro Kawano and Junzo Watada and Takayuki Kawaura",
    year = "2005",
    language = "English",
    isbn = "3540288961",
    volume = "3683 LNAI",
    series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
    pages = "1122--1128",
    booktitle = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

    }

    TY - GEN

    T1 - Data mining method from text database

    AU - Kawano, Masahiro

    AU - Watada, Junzo

    AU - Kawaura, Takayuki

    PY - 2005

    Y1 - 2005

    N2 - Recently, various types of data are expected to get in information processing according to multi-media technology. Especially, linguistic data are employed in fuzzy systems as well as fuzzy numerical values. In this paper we propose a text minig method based on fuzzy quantification model. In the process of text mining, we will pursue the following steps: 1) Sentences included in a text in Japanese are broken down into words. 2) It is possible to realize common understanding using fuzzy thesaurus that enables us to translate words into synonyms or into upper concepts. In this paper, we employ the method to translate words using Chinese characters or continuous letters of Katakana more then one katakana letter (Japanese alphabet letter) into keywords. The method realizes the high speed of processing without any dictionary for separating words. Fuzzy multivariate analysis is employed to analyze such processed data and to abstract a latent mutual related structure under the data. In other words, we abstract the knowledge from the given text data. At the end we apply the method to mining the text information of libraries and Web pages distributed over a web network and discussing about the application to Kansei engineering.

    AB - Recently, various types of data are expected to get in information processing according to multi-media technology. Especially, linguistic data are employed in fuzzy systems as well as fuzzy numerical values. In this paper we propose a text minig method based on fuzzy quantification model. In the process of text mining, we will pursue the following steps: 1) Sentences included in a text in Japanese are broken down into words. 2) It is possible to realize common understanding using fuzzy thesaurus that enables us to translate words into synonyms or into upper concepts. In this paper, we employ the method to translate words using Chinese characters or continuous letters of Katakana more then one katakana letter (Japanese alphabet letter) into keywords. The method realizes the high speed of processing without any dictionary for separating words. Fuzzy multivariate analysis is employed to analyze such processed data and to abstract a latent mutual related structure under the data. In other words, we abstract the knowledge from the given text data. At the end we apply the method to mining the text information of libraries and Web pages distributed over a web network and discussing about the application to Kansei engineering.

    KW - Fuzzy quantification analysis

    KW - Library data

    KW - Text mining

    UR - http://www.scopus.com/inward/record.url?scp=33745304208&partnerID=8YFLogxK

    UR - http://www.scopus.com/inward/citedby.url?scp=33745304208&partnerID=8YFLogxK

    M3 - Conference contribution

    AN - SCOPUS:33745304208

    SN - 3540288961

    SN - 9783540288961

    VL - 3683 LNAI

    T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

    SP - 1122

    EP - 1128

    BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

    ER -