Solving the imbalanced data classification problem with the particle swarm optimization based support vector machine

Zhenyuan Xu, Juilzo Watada, Mingnan Wu, Zuwarie Ibrahim, Marzuki Khalid

    Research output: Contribution to journalArticle

    4 Citations (Scopus)

    Abstract

    A database contains a wealth of hidden knowledge that can be used in decision making to support commerce, business, management, research and other activities. Classification analysis plays a pivotal role in the pattern recognition field, where it is considered as a core method. Algorithms such as support vector machine (SVM) and artificial neural network (ANN) have been proposed to solve the problem of binary classification according to data distributions. But these traditional classification algorithms are unable to provide satisfying results for an imbalanced dataset with special characters. In this paper, we propose a model based on particle swarm optimization (PSO) and support vector machine (SVM) for using in the classification of a large, imbalanced dataset. This model is referred to as the PSO-SVM (particle swarm optimization-based support vector machine) model. PSO was recently proposed as a metaheuristic framework for large, imbalanced dataset classification. The SVM algorithm also exhibits a high level of performance in handling balanced binary classification. Therefore, the novel model proposed here is introduced to improve classification accuracy by combining support vector classification (SVC) with an imbalanced PSO. The G-mean is used to evaluate the final results. In the final section of this paper, the proposed method is compared with some conventional heuristic models. The experimental results demonstrate that the proposed method exhibits a high level of performance for imbalanced dataset classification.

    Original languageEnglish
    Pages (from-to)788-795
    Number of pages8
    JournalIEEJ Transactions on Electronics, Information and Systems
    Volume134
    Issue number6
    DOIs
    Publication statusPublished - 2014

    Keywords

    • Imbalanced dataset classification
    • Particle swarm optimization (PSO)
    • Particle swarm optimization-based support vector machine (PSO-SVM)
    • Support vector classification (SVC)

    ASJC Scopus subject areas

    • Electrical and Electronic Engineering

    Fingerprint Dive into the research topics of 'Solving the imbalanced data classification problem with the particle swarm optimization based support vector machine'. Together they form a unique fingerprint.

  • Cite this