Document classification method with small training data

Yasunari Maeda, Hideki Yoshida, Toshiyasu Matsushima

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Abstract

    Document classification is one of important topics in the field of NLP(Natural Language Processing). In our previous research we've proposed a document classification method which minimizes an error rate with reference to a Bayes criterion. But when the number of documents in training data is small, the accuracy of the previous method is low. So in this research we propose a document classification method whose accuracy is higher than the previous method when the number of documents in training data is small.

    Original languageEnglish
    Title of host publicationICCAS-SICE 2009 - ICROS-SICE International Joint Conference 2009, Proceedings
    Pages138-141
    Number of pages4
    Publication statusPublished - 2009
    EventICROS-SICE International Joint Conference 2009, ICCAS-SICE 2009 - Fukuoka
    Duration: 2009 Aug 182009 Aug 21

    Other

    OtherICROS-SICE International Joint Conference 2009, ICCAS-SICE 2009
    CityFukuoka
    Period09/8/1809/8/21

    Fingerprint

    Processing

    Keywords

    • Document classification
    • Estimating data
    • Prior distributions
    • Small training data

    ASJC Scopus subject areas

    • Information Systems
    • Control and Systems Engineering
    • Industrial and Manufacturing Engineering

    Cite this

    Maeda, Y., Yoshida, H., & Matsushima, T. (2009). Document classification method with small training data. In ICCAS-SICE 2009 - ICROS-SICE International Joint Conference 2009, Proceedings (pp. 138-141). [5333327]

    Document classification method with small training data. / Maeda, Yasunari; Yoshida, Hideki; Matsushima, Toshiyasu.

    ICCAS-SICE 2009 - ICROS-SICE International Joint Conference 2009, Proceedings. 2009. p. 138-141 5333327.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Maeda, Y, Yoshida, H & Matsushima, T 2009, Document classification method with small training data. in ICCAS-SICE 2009 - ICROS-SICE International Joint Conference 2009, Proceedings., 5333327, pp. 138-141, ICROS-SICE International Joint Conference 2009, ICCAS-SICE 2009, Fukuoka, 09/8/18.
    Maeda Y, Yoshida H, Matsushima T. Document classification method with small training data. In ICCAS-SICE 2009 - ICROS-SICE International Joint Conference 2009, Proceedings. 2009. p. 138-141. 5333327
    Maeda, Yasunari ; Yoshida, Hideki ; Matsushima, Toshiyasu. / Document classification method with small training data. ICCAS-SICE 2009 - ICROS-SICE International Joint Conference 2009, Proceedings. 2009. pp. 138-141
    @inproceedings{b2c142d21f214b63a2557bb48254cc8e,
    title = "Document classification method with small training data",
    abstract = "Document classification is one of important topics in the field of NLP(Natural Language Processing). In our previous research we've proposed a document classification method which minimizes an error rate with reference to a Bayes criterion. But when the number of documents in training data is small, the accuracy of the previous method is low. So in this research we propose a document classification method whose accuracy is higher than the previous method when the number of documents in training data is small.",
    keywords = "Document classification, Estimating data, Prior distributions, Small training data",
    author = "Yasunari Maeda and Hideki Yoshida and Toshiyasu Matsushima",
    year = "2009",
    language = "English",
    isbn = "9784907764333",
    pages = "138--141",
    booktitle = "ICCAS-SICE 2009 - ICROS-SICE International Joint Conference 2009, Proceedings",

    }

    TY - GEN

    T1 - Document classification method with small training data

    AU - Maeda, Yasunari

    AU - Yoshida, Hideki

    AU - Matsushima, Toshiyasu

    PY - 2009

    Y1 - 2009

    N2 - Document classification is one of important topics in the field of NLP(Natural Language Processing). In our previous research we've proposed a document classification method which minimizes an error rate with reference to a Bayes criterion. But when the number of documents in training data is small, the accuracy of the previous method is low. So in this research we propose a document classification method whose accuracy is higher than the previous method when the number of documents in training data is small.

    AB - Document classification is one of important topics in the field of NLP(Natural Language Processing). In our previous research we've proposed a document classification method which minimizes an error rate with reference to a Bayes criterion. But when the number of documents in training data is small, the accuracy of the previous method is low. So in this research we propose a document classification method whose accuracy is higher than the previous method when the number of documents in training data is small.

    KW - Document classification

    KW - Estimating data

    KW - Prior distributions

    KW - Small training data

    UR - http://www.scopus.com/inward/record.url?scp=77951142189&partnerID=8YFLogxK

    UR - http://www.scopus.com/inward/citedby.url?scp=77951142189&partnerID=8YFLogxK

    M3 - Conference contribution

    SN - 9784907764333

    SP - 138

    EP - 141

    BT - ICCAS-SICE 2009 - ICROS-SICE International Joint Conference 2009, Proceedings

    ER -