Tracking problem is one of the popular benchmark to evaluate reinforcement learning. In the tracking problem, some hunters trace a target and try to catch target in shorter steps. In the paper, we propose to separate decision marking process of reinforcement learning from two points of view; strategy decision and tactical decision. Strategy decision decides the movement policy of the hunters, and tactical decision decides the movement direction of each hunter. Experimental results showed that our method could catch the target with 54% steps by the conventional reinforcement learning.
雑誌名
宮崎大學工學部紀要
巻
45
ページ
221 - 225
発行年
2016-07-29
出版者
宮崎大学工学部
Miyazaki University
Faculty of Engineering, University of Miyazaki