A quasi-linear SVM combined with assembled SMOTE for imbalanced data classification

Bo Zhou, Cheng Yang, Haixiang Guo, Jinglu Hu

研究成果: Conference contribution

20 被引用数 (Scopus)

抄録

This paper focuses on imbalanced dataset classification problem by using SVM and oversampling method. Traditional oversampling method increases the occurrence of over-lapping between classes, which leads to poor generalization of SVM classification. To solve this problem this paper proposes a combined method of quasi-linear SVM and assembled SMOTE. The quasi-linear SVM is an SVM with quasi-linear kernel function. It realizes an approximate nonlinear separation boundary by mulit-local linear boundaries with interpolation. The assembled SMOTE implements oversampling with considering of the data distribution information and avoids occurrence of overlapping between classes. Firstly, a partition method based on Minimal Spanning Tree is proposed to obtain local linear partitions, each of which can be separated with one linear separation boundary. Secondly, using the information of local linear partitions, the assembled SMOTE generates synthetic minority class samples. Finally, the quasi-linear SVM realizes a classification of oversampled datasets in the same way as a standard SVM by using a composite quasi-linear kernel function. Experiment results on artificial data and benchmark datasets show that the proposed method is effective and improves classification performances.

本文言語English
ホスト出版物のタイトル2013 International Joint Conference on Neural Networks, IJCNN 2013
DOI
出版ステータスPublished - 2013
イベント2013 International Joint Conference on Neural Networks, IJCNN 2013 - Dallas, TX, United States
継続期間: 2013 8月 42013 8月 9

出版物シリーズ

名前Proceedings of the International Joint Conference on Neural Networks

Conference

Conference2013 International Joint Conference on Neural Networks, IJCNN 2013
国/地域United States
CityDallas, TX
Period13/8/413/8/9

ASJC Scopus subject areas

  • ソフトウェア
  • 人工知能

フィンガープリント

「A quasi-linear SVM combined with assembled SMOTE for imbalanced data classification」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル