A new segmented oversampling method for imbalanced data classification using quasi-linear SVM

Bo Zhou, Weite Li, Jinglu Hu*

*この研究の対応する著者

研究成果: Article査読

2 被引用数 (Scopus)

抄録

Data imbalance occurs on most real-world classification problems and decreases the performance of classifiers. An oversampling method addresses the imbalance problem by generating synthetic samples to balance the data distribution. However, many of the existing oversampling methods have potential problems in generating wrong and unnecessary synthetic samples, which makes the learning tasks difficult. This paper proposes a new segmented oversampling method for imbalanced data classification. The input space is first partitioned into several linearly separable local partitions along the potential separation boundary by introducing a bottom-up, minimal-spanning-tree-based clustering method; an oversampling method is then applied within each local linear partition to prevent the generation of wrong and unnecessary synthetic samples; a quasi-linear support vector machine is finally used to realize the classification by taking advantages of the local linear partitions. Simulation results on different real-world datasets show that the proposed segmented oversampling method is effective for imbalanced data classifications.

本文言語English
ページ(範囲)891-898
ページ数8
ジャーナルIEEJ Transactions on Electrical and Electronic Engineering
12
6
DOI
出版ステータスPublished - 2017 11月

ASJC Scopus subject areas

  • 電子工学および電気工学

フィンガープリント

「A new segmented oversampling method for imbalanced data classification using quasi-linear SVM」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル