Feature subset selection

A correlation-based SVM filter approach

Boyang Li, Qiangwei Wang, Takayuki Furuzuki

Research output: Contribution to journalArticle

3 Citations (Scopus)

Abstract

The central criterion of feature selection is that good feature sets contain features that are highly correlated with the output, yet uncorrelated with each other. Based on this criterion, we address the problem of feature selection through correlation-based feature clustering and support vector machine (SVM) based feature ranking. Correlation-based clustering is proposed to group features into some clusters based on the correlation between two features. As a result, a feature is highly correlated to any other feature in the same cluster but uncorrelated to the features in other clusters. From each cluster, we select a feature as the delegate based on its influence quantities on the output. The influence quantities are measured by the feature sensitivity in the SVM. The proposed approach can identify relevant features and eliminate redundancy among them effectively. The effectiveness of the proposed approach is demonstrated through comparisons with other methods using real-world data with different dimensions.

Original languageEnglish
Pages (from-to)173-179
Number of pages7
JournalIEEJ Transactions on Electrical and Electronic Engineering
Volume6
Issue number2
DOIs
Publication statusPublished - 2011 Mar

Fingerprint

Support vector machines
Feature extraction
Redundancy

Keywords

  • Correlation-based clustering
  • Feature ranking
  • Feature selection
  • Support vector machine

ASJC Scopus subject areas

  • Electrical and Electronic Engineering

Cite this

Feature subset selection : A correlation-based SVM filter approach. / Li, Boyang; Wang, Qiangwei; Furuzuki, Takayuki.

In: IEEJ Transactions on Electrical and Electronic Engineering, Vol. 6, No. 2, 03.2011, p. 173-179.

Research output: Contribution to journalArticle

@article{642b3598fab54d49be2dc5f658e92ede,
title = "Feature subset selection: A correlation-based SVM filter approach",
abstract = "The central criterion of feature selection is that good feature sets contain features that are highly correlated with the output, yet uncorrelated with each other. Based on this criterion, we address the problem of feature selection through correlation-based feature clustering and support vector machine (SVM) based feature ranking. Correlation-based clustering is proposed to group features into some clusters based on the correlation between two features. As a result, a feature is highly correlated to any other feature in the same cluster but uncorrelated to the features in other clusters. From each cluster, we select a feature as the delegate based on its influence quantities on the output. The influence quantities are measured by the feature sensitivity in the SVM. The proposed approach can identify relevant features and eliminate redundancy among them effectively. The effectiveness of the proposed approach is demonstrated through comparisons with other methods using real-world data with different dimensions.",
keywords = "Correlation-based clustering, Feature ranking, Feature selection, Support vector machine",
author = "Boyang Li and Qiangwei Wang and Takayuki Furuzuki",
year = "2011",
month = "3",
doi = "10.1002/tee.20641",
language = "English",
volume = "6",
pages = "173--179",
journal = "IEEJ Transactions on Electrical and Electronic Engineering",
issn = "1931-4973",
publisher = "John Wiley and Sons Inc.",
number = "2",

}

TY - JOUR

T1 - Feature subset selection

T2 - A correlation-based SVM filter approach

AU - Li, Boyang

AU - Wang, Qiangwei

AU - Furuzuki, Takayuki

PY - 2011/3

Y1 - 2011/3

N2 - The central criterion of feature selection is that good feature sets contain features that are highly correlated with the output, yet uncorrelated with each other. Based on this criterion, we address the problem of feature selection through correlation-based feature clustering and support vector machine (SVM) based feature ranking. Correlation-based clustering is proposed to group features into some clusters based on the correlation between two features. As a result, a feature is highly correlated to any other feature in the same cluster but uncorrelated to the features in other clusters. From each cluster, we select a feature as the delegate based on its influence quantities on the output. The influence quantities are measured by the feature sensitivity in the SVM. The proposed approach can identify relevant features and eliminate redundancy among them effectively. The effectiveness of the proposed approach is demonstrated through comparisons with other methods using real-world data with different dimensions.

AB - The central criterion of feature selection is that good feature sets contain features that are highly correlated with the output, yet uncorrelated with each other. Based on this criterion, we address the problem of feature selection through correlation-based feature clustering and support vector machine (SVM) based feature ranking. Correlation-based clustering is proposed to group features into some clusters based on the correlation between two features. As a result, a feature is highly correlated to any other feature in the same cluster but uncorrelated to the features in other clusters. From each cluster, we select a feature as the delegate based on its influence quantities on the output. The influence quantities are measured by the feature sensitivity in the SVM. The proposed approach can identify relevant features and eliminate redundancy among them effectively. The effectiveness of the proposed approach is demonstrated through comparisons with other methods using real-world data with different dimensions.

KW - Correlation-based clustering

KW - Feature ranking

KW - Feature selection

KW - Support vector machine

UR - http://www.scopus.com/inward/record.url?scp=79551667272&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=79551667272&partnerID=8YFLogxK

U2 - 10.1002/tee.20641

DO - 10.1002/tee.20641

M3 - Article

VL - 6

SP - 173

EP - 179

JO - IEEJ Transactions on Electrical and Electronic Engineering

JF - IEEJ Transactions on Electrical and Electronic Engineering

SN - 1931-4973

IS - 2

ER -