Hierarchical multilabel classifier for gene ontology annotation using multihead and multiend deep CNN model

Xin Yuan, Erli Pang, Kui Lin, Jinglu Hu

Research output: Contribution to journalArticle

Abstract

Gene ontology annotation is known to be a very complicated multilabel classification task, and the hierarchical multilabel classification (HMC) approaches with local classifiers have been shown to be effective for the task. In a traditional HMC method, a set of hierarchically organized simple local classifiers are usually used, each of which for one hierarchical level separately. In this paper, we propose a novel hierarchical multilabel classifier implementing the whole set of hierarchically organized local classifiers in one deep convolution neural network (CNN) model with multiple heads and multiple ends (MHME). The proposed MHME CNN model consists of three parts: the body part of a deep CNN model shared by different local classifiers for feature extraction and feature mapping; the multiend part of a set of autoencoders performing feature fusion transforming the input vectors of different local classifiers to feature vectors with the same length so as to share the feature mapping part; and the multihead part of a set of linear multilabel classifiers. Furthermore, a sophisticated recursive algorithm is designed to train the MHME CNN model to realize the functions of a set of hierarchically organized local classifiers. In this way, by sharing a deep CNN with multiple local classifiers, we are able to construct more powerful local classifiers for each level with limited training samples, and to achieve better classification performance. Experiment results on various benchmark datasets show that the proposed deep CNN-based model has better performance than the state-of-the-art traditional models. Moreover, it gives rather good performance even under a transfer learning.

Original languageEnglish
Pages (from-to)1057-1064
Number of pages8
JournalIEEJ Transactions on Electrical and Electronic Engineering
Volume15
Issue number7
DOIs
Publication statusPublished - 2020 Jul 1

Keywords

  • autoencoder
  • deep learning
  • feature extraction
  • GO annotation
  • hierarchical multilabel classification

ASJC Scopus subject areas

  • Electrical and Electronic Engineering

Fingerprint Dive into the research topics of 'Hierarchical multilabel classifier for gene ontology annotation using multihead and multiend deep CNN model'. Together they form a unique fingerprint.

  • Cite this