A SVM-RBF method for solving imbalanced data problem

Lei Ding, Junzo Watada, Lim Chun Chew, Zuwairie Ibrahim, Lee Wen Jau, Marzuki Khalid

    Research output: Contribution to journalArticle

    9 Citations (Scopus)

    Abstract

    Although knowledge discovery and data mining techniques have successfully resolved a lot of real-world applications, classifying an unbalanced data is still full of challenge. The imbalanced data distribution led to the results of classification emphasis on the majority class. In another word, the accuracy for the minority class is often quite low. Traditional classification methods such as ANN, KNN, cannot solve the problem effectively. How to improve the accuracy of unbalanced data classification has attracted growing attention from both academia and industry. The object of this paper is to build a fused method consisting of data scaling, a re-sampling technique and the SVM-RBF-based method to classify a large unbalanced data set obtained in semiconductor industry. It is a classifier model to predict an output result which will be used for producing health control. The experimental results showed that the classification accuracy of the minority class had a great improvement by using SVM-RBF Model. ICIC International

    Original languageEnglish
    Pages (from-to)2419-2424
    Number of pages6
    JournalICIC Express Letters
    Volume4
    Issue number6 B
    Publication statusPublished - 2010 Dec

      Fingerprint

    Keywords

    • Imbalanced data
    • RBF kernel function
    • SVM classifier

    ASJC Scopus subject areas

    • Computer Science(all)
    • Control and Systems Engineering

    Cite this

    Ding, L., Watada, J., Chew, L. C., Ibrahim, Z., Jau, L. W., & Khalid, M. (2010). A SVM-RBF method for solving imbalanced data problem. ICIC Express Letters, 4(6 B), 2419-2424.