TY - GEN
T1 - Improvement for action strategy learning in classification task using classification probalilities
AU - Kim, Chyon Hae
AU - Yamazaki, Shota
AU - Tsujino, Hiroshi
AU - Sugano, Shigeki
PY - 2012
Y1 - 2012
N2 - In this paper, we address the autonomous evidence accumulation when a system classifies an object into one of predetermined categories. We propose a reinforcement learning system that effectively selects actions to speed up the classification process. The proposed system accelerates its learning using classification probabilities calculated by a classification system. We conducted three binary classification experiments to evaluate the learning speed and correctness of the proposed system. In the first experiment, we examined a random action selection strategy that does not learn its selection parameters while accumulating evidence. In the second experiment, we examined Paletta's reinforcement learning system that observes the state of the object and learns action selection strategy. In the third experiment, we examined the proposed system that observes both the object state and the classification probability. The proposed system showed the fastest learning.
AB - In this paper, we address the autonomous evidence accumulation when a system classifies an object into one of predetermined categories. We propose a reinforcement learning system that effectively selects actions to speed up the classification process. The proposed system accelerates its learning using classification probabilities calculated by a classification system. We conducted three binary classification experiments to evaluate the learning speed and correctness of the proposed system. In the first experiment, we examined a random action selection strategy that does not learn its selection parameters while accumulating evidence. In the second experiment, we examined Paletta's reinforcement learning system that observes the state of the object and learns action selection strategy. In the third experiment, we examined the proposed system that observes both the object state and the classification probability. The proposed system showed the fastest learning.
UR - http://www.scopus.com/inward/record.url?scp=84877814194&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84877814194&partnerID=8YFLogxK
U2 - 10.1109/SCIS-ISIS.2012.6505012
DO - 10.1109/SCIS-ISIS.2012.6505012
M3 - Conference contribution
AN - SCOPUS:84877814194
SN - 9781467327428
T3 - 6th International Conference on Soft Computing and Intelligent Systems, and 13th International Symposium on Advanced Intelligence Systems, SCIS/ISIS 2012
SP - 352
EP - 358
BT - 6th International Conference on Soft Computing and Intelligent Systems, and 13th International Symposium on Advanced Intelligence Systems, SCIS/ISIS 2012
T2 - 2012 Joint 6th International Conference on Soft Computing and Intelligent Systems, SCIS 2012 and 13th International Symposium on Advanced Intelligence Systems, ISIS 2012
Y2 - 20 November 2012 through 24 November 2012
ER -