Cooperative target searching and tracking via UCT with probability distribution model
Abstract
As Unmanned Aerial Vehicle's (UAV) battery life and stability develop, multiple UAVs are having more and more applications in the uninterrupted patrol and security. Thus UAV's searching, tracking and trajectory planning become important issues. This paper proposes an online distributed algorithm used in UAV's tracking and searching, with the consideration of UAV's practical need to recharge under limited power. We propose a Quantum Probability Model to describe the partially observable target positions, and we use Upper Confidence Tree (UCT) algorithm to find out the best searching and tracking route based on this model. We also introduce the Teammate Learning Model to handle the nonstationary problems in distributed reinforcement learning.