Abstract
Classical estimation of distribution algorithms (EDAs) generally use truncation selection to estimate the distribution of the good individuals while ignoring the bad ones. However, various researches in evolutionary algorithms (EAs) have reported that the bad individuals may affect and help solving the problem. This paper proposes a new method to use the bad individuals by studying the substructures rather than the entire individual structures to solve reinforcement learning (RL) problems, which generally factorize their entire solutions to the sequences of state-action pairs. This work was studied in a recent graph-based EDA named probabilistic model building genetic network programming (PMBGNP), which could solve RL problems successfully, to propose an extended PMBGNP. The effectiveness of this work is verified in an RL problem, namely robot control. Compared to other related work, results show that the proposed method can significantly speed up the evolution efficiency.
Original language | English |
---|---|
Pages (from-to) | 339-347 |
Number of pages | 9 |
Journal | IEEJ Transactions on Electrical and Electronic Engineering |
Volume | 8 |
Issue number | 4 |
DOIs | |
Publication status | Published - 2013 Jul |
Keywords
- Bad individuals
- Estimation of distribution algorithms (EDAs)
- Probabilistic model building genetic network programming
- Probabilistic modeling
- Reinforcement learning
ASJC Scopus subject areas
- Electrical and Electronic Engineering