A hybrid feature selection algorithm for gene expression data classification

Huijuan Lu, Junying Chen, Ke Yan, Qun Jin, Yu Xue, Zhigang Gao

    Research output: Contribution to journalArticle

    61 Citations (Scopus)

    Abstract

    In the DNA microarray research field, the increasing sample size and feature dimension of the gene expression data prompt the development of an efficient and robust feature selection algorithm for gene expression data classification. In this study, we propose a hybrid feature selection algorithm that combines the mutual information maximization (MIM) and the adaptive genetic algorithm (AGA). Experimental results show that the proposing MIMAGA-Selection method significantly reduces the dimension of gene expression data and removes the redundancies for classification. The reduced gene expression dataset provides highest classification accuracy compared to conventional feature selection algorithms. We also apply four different classifiers to the reduced dataset to demonstrate the robustness of the proposed MIMAGA-Selection algorithm.

    Original languageEnglish
    JournalNeurocomputing
    DOIs
    Publication statusAccepted/In press - 2016 May 14

    Fingerprint

    Gene expression
    Feature extraction
    Gene Expression
    Microarrays
    Adaptive algorithms
    Redundancy
    Oligonucleotide Array Sequence Analysis
    DNA
    Classifiers
    Genetic algorithms
    Sample Size
    Research
    Datasets

    Keywords

    • Adaptive genetic algorithm
    • Feature selection
    • Gene expression data
    • Mutual information maximization

    ASJC Scopus subject areas

    • Computer Science Applications
    • Cognitive Neuroscience
    • Artificial Intelligence

    Cite this

    A hybrid feature selection algorithm for gene expression data classification. / Lu, Huijuan; Chen, Junying; Yan, Ke; Jin, Qun; Xue, Yu; Gao, Zhigang.

    In: Neurocomputing, 14.05.2016.

    Research output: Contribution to journalArticle

    Lu, Huijuan ; Chen, Junying ; Yan, Ke ; Jin, Qun ; Xue, Yu ; Gao, Zhigang. / A hybrid feature selection algorithm for gene expression data classification. In: Neurocomputing. 2016.
    @article{373bcc2c6a5a4633b6a3aa0b289ce6f6,
    title = "A hybrid feature selection algorithm for gene expression data classification",
    abstract = "In the DNA microarray research field, the increasing sample size and feature dimension of the gene expression data prompt the development of an efficient and robust feature selection algorithm for gene expression data classification. In this study, we propose a hybrid feature selection algorithm that combines the mutual information maximization (MIM) and the adaptive genetic algorithm (AGA). Experimental results show that the proposing MIMAGA-Selection method significantly reduces the dimension of gene expression data and removes the redundancies for classification. The reduced gene expression dataset provides highest classification accuracy compared to conventional feature selection algorithms. We also apply four different classifiers to the reduced dataset to demonstrate the robustness of the proposed MIMAGA-Selection algorithm.",
    keywords = "Adaptive genetic algorithm, Feature selection, Gene expression data, Mutual information maximization",
    author = "Huijuan Lu and Junying Chen and Ke Yan and Qun Jin and Yu Xue and Zhigang Gao",
    year = "2016",
    month = "5",
    day = "14",
    doi = "10.1016/j.neucom.2016.07.080",
    language = "English",
    journal = "Neurocomputing",
    issn = "0925-2312",
    publisher = "Elsevier",

    }

    TY - JOUR

    T1 - A hybrid feature selection algorithm for gene expression data classification

    AU - Lu, Huijuan

    AU - Chen, Junying

    AU - Yan, Ke

    AU - Jin, Qun

    AU - Xue, Yu

    AU - Gao, Zhigang

    PY - 2016/5/14

    Y1 - 2016/5/14

    N2 - In the DNA microarray research field, the increasing sample size and feature dimension of the gene expression data prompt the development of an efficient and robust feature selection algorithm for gene expression data classification. In this study, we propose a hybrid feature selection algorithm that combines the mutual information maximization (MIM) and the adaptive genetic algorithm (AGA). Experimental results show that the proposing MIMAGA-Selection method significantly reduces the dimension of gene expression data and removes the redundancies for classification. The reduced gene expression dataset provides highest classification accuracy compared to conventional feature selection algorithms. We also apply four different classifiers to the reduced dataset to demonstrate the robustness of the proposed MIMAGA-Selection algorithm.

    AB - In the DNA microarray research field, the increasing sample size and feature dimension of the gene expression data prompt the development of an efficient and robust feature selection algorithm for gene expression data classification. In this study, we propose a hybrid feature selection algorithm that combines the mutual information maximization (MIM) and the adaptive genetic algorithm (AGA). Experimental results show that the proposing MIMAGA-Selection method significantly reduces the dimension of gene expression data and removes the redundancies for classification. The reduced gene expression dataset provides highest classification accuracy compared to conventional feature selection algorithms. We also apply four different classifiers to the reduced dataset to demonstrate the robustness of the proposed MIMAGA-Selection algorithm.

    KW - Adaptive genetic algorithm

    KW - Feature selection

    KW - Gene expression data

    KW - Mutual information maximization

    UR - http://www.scopus.com/inward/record.url?scp=85014772083&partnerID=8YFLogxK

    UR - http://www.scopus.com/inward/citedby.url?scp=85014772083&partnerID=8YFLogxK

    U2 - 10.1016/j.neucom.2016.07.080

    DO - 10.1016/j.neucom.2016.07.080

    M3 - Article

    JO - Neurocomputing

    JF - Neurocomputing

    SN - 0925-2312

    ER -