Quasi-linear SVM classifier with segmented local offsets for imbalanced data classification

Peifeng Liang, Feng Zheng, Weite Li, Takayuki Furuzuki

Research output: Contribution to journalArticle

1 Citation (Scopus)

Abstract

Within-class imbalance problems often occur in imbalanced data classification, which worsen the imbalance distribution problem and increase the learning concept complexity. However, most existing methods for the imbalanced data classification focus on rectifying the between-class imbalance problem, which is insufficient and inappropriate in many different scenarios. This paper proposes a simple yet effective support vector machine (SVM) classifier with local offset adjustment for imbalance classification problems. First, a geometry-based partitioning method is modified for imbalanced datasets to divide the input space into multiple linearly separable partitions along the potential separation boundary. Then an F-score-based method is applied to obtain local offsets optimized on each local cluster. Finally, by constructing a quasi-linear kernel based on the partitioning information, a quasi-linear SVM classifier with local offsets is constructed for the imbalanced datasets. Simulation results on different real-world datasets show that the proposed method is effective for imbalanced data classifications.

Original languageEnglish
JournalIEEJ Transactions on Electrical and Electronic Engineering
DOIs
Publication statusAccepted/In press - 2018 Jan 1

Fingerprint

Support vector machines
Classifiers
Geometry

Keywords

  • imbalanced data classification
  • kernel composition
  • local linear partition
  • local offset method
  • support vector machine
  • within-class imbalances

ASJC Scopus subject areas

  • Electrical and Electronic Engineering

Cite this

@article{da5f29a03fa54cc2b0f7f642a3b417ec,
title = "Quasi-linear SVM classifier with segmented local offsets for imbalanced data classification",
abstract = "Within-class imbalance problems often occur in imbalanced data classification, which worsen the imbalance distribution problem and increase the learning concept complexity. However, most existing methods for the imbalanced data classification focus on rectifying the between-class imbalance problem, which is insufficient and inappropriate in many different scenarios. This paper proposes a simple yet effective support vector machine (SVM) classifier with local offset adjustment for imbalance classification problems. First, a geometry-based partitioning method is modified for imbalanced datasets to divide the input space into multiple linearly separable partitions along the potential separation boundary. Then an F-score-based method is applied to obtain local offsets optimized on each local cluster. Finally, by constructing a quasi-linear kernel based on the partitioning information, a quasi-linear SVM classifier with local offsets is constructed for the imbalanced datasets. Simulation results on different real-world datasets show that the proposed method is effective for imbalanced data classifications.",
keywords = "imbalanced data classification, kernel composition, local linear partition, local offset method, support vector machine, within-class imbalances",
author = "Peifeng Liang and Feng Zheng and Weite Li and Takayuki Furuzuki",
year = "2018",
month = "1",
day = "1",
doi = "10.1002/tee.22808",
language = "English",
journal = "IEEJ Transactions on Electrical and Electronic Engineering",
issn = "1931-4973",
publisher = "John Wiley and Sons Inc.",

}

TY - JOUR

T1 - Quasi-linear SVM classifier with segmented local offsets for imbalanced data classification

AU - Liang, Peifeng

AU - Zheng, Feng

AU - Li, Weite

AU - Furuzuki, Takayuki

PY - 2018/1/1

Y1 - 2018/1/1

N2 - Within-class imbalance problems often occur in imbalanced data classification, which worsen the imbalance distribution problem and increase the learning concept complexity. However, most existing methods for the imbalanced data classification focus on rectifying the between-class imbalance problem, which is insufficient and inappropriate in many different scenarios. This paper proposes a simple yet effective support vector machine (SVM) classifier with local offset adjustment for imbalance classification problems. First, a geometry-based partitioning method is modified for imbalanced datasets to divide the input space into multiple linearly separable partitions along the potential separation boundary. Then an F-score-based method is applied to obtain local offsets optimized on each local cluster. Finally, by constructing a quasi-linear kernel based on the partitioning information, a quasi-linear SVM classifier with local offsets is constructed for the imbalanced datasets. Simulation results on different real-world datasets show that the proposed method is effective for imbalanced data classifications.

AB - Within-class imbalance problems often occur in imbalanced data classification, which worsen the imbalance distribution problem and increase the learning concept complexity. However, most existing methods for the imbalanced data classification focus on rectifying the between-class imbalance problem, which is insufficient and inappropriate in many different scenarios. This paper proposes a simple yet effective support vector machine (SVM) classifier with local offset adjustment for imbalance classification problems. First, a geometry-based partitioning method is modified for imbalanced datasets to divide the input space into multiple linearly separable partitions along the potential separation boundary. Then an F-score-based method is applied to obtain local offsets optimized on each local cluster. Finally, by constructing a quasi-linear kernel based on the partitioning information, a quasi-linear SVM classifier with local offsets is constructed for the imbalanced datasets. Simulation results on different real-world datasets show that the proposed method is effective for imbalanced data classifications.

KW - imbalanced data classification

KW - kernel composition

KW - local linear partition

KW - local offset method

KW - support vector machine

KW - within-class imbalances

UR - http://www.scopus.com/inward/record.url?scp=85054555903&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85054555903&partnerID=8YFLogxK

U2 - 10.1002/tee.22808

DO - 10.1002/tee.22808

M3 - Article

AN - SCOPUS:85054555903

JO - IEEJ Transactions on Electrical and Electronic Engineering

JF - IEEJ Transactions on Electrical and Electronic Engineering

SN - 1931-4973

ER -