An extended probabilistic model building genetic network programming using both of good and bad individuals

Xianneng Li, Shingo Mabu, Kotaro Hirasawa

Research output: Contribution to journalArticle

11 Citations (Scopus)

Abstract

Classical estimation of distribution algorithms (EDAs) generally use truncation selection to estimate the distribution of the good individuals while ignoring the bad ones. However, various researches in evolutionary algorithms (EAs) have reported that the bad individuals may affect and help solving the problem. This paper proposes a new method to use the bad individuals by studying the substructures rather than the entire individual structures to solve reinforcement learning (RL) problems, which generally factorize their entire solutions to the sequences of state-action pairs. This work was studied in a recent graph-based EDA named probabilistic model building genetic network programming (PMBGNP), which could solve RL problems successfully, to propose an extended PMBGNP. The effectiveness of this work is verified in an RL problem, namely robot control. Compared to other related work, results show that the proposed method can significantly speed up the evolution efficiency.

Original languageEnglish
Pages (from-to)339-347
Number of pages9
JournalIEEJ Transactions on Electrical and Electronic Engineering
Volume8
Issue number4
DOIs
Publication statusPublished - 2013 Jul

    Fingerprint

Keywords

  • Bad individuals
  • Estimation of distribution algorithms (EDAs)
  • Probabilistic model building genetic network programming
  • Probabilistic modeling
  • Reinforcement learning

ASJC Scopus subject areas

  • Electrical and Electronic Engineering

Cite this