The main interest of the organizational learning is that the agent balances between exploring and exploiting. When the agent decides the answer in finite time, it gives rise to the problem that is trade-off between exploring and exploiting. This problem never arises in the given enough time. The agent must decide answer from his imperfect information, when the time is given finite for the task. 2-arms bandit problem is often taken into consideration in this problem. In previous article, we proposed the model for weak identity that is dynamically changing the hierarchy of his knowledge. Recently, the method of the heuristic model is attracted to approach this problem. In this article, we proposed different approaches for 2-arms bandit problem, and the model we proposed can adapt to the environment when the condition of the problem is changed and shows another approach to the organizational learning.
ASJC Scopus subject areas