IEEE Transactions on Automatic Control, Vol.55, No.5, 1101-1116, 2010
Solving Continuous-State POMDPs via Density Projection
Research on numerical solution methods for partially observable Markov decision processes (POMDPs) has primarily focused on finite-state models, and these algorithms do not generally extend to continuous-state POMDPs, due to the infinite dimensionality of the belief space. In this paper, we develop a computationally viable and theoretically sound method for solving continuous-state POMDPs by effectively reducing the dimensionality of the belief space via density projection. The density projection technique is also incorporated into particle filtering to provide a filtering scheme for online decision making. We provide an error bound between the value function induced by the policy obtained by our method and the true value function of the POMDP, and also an error bound between projection particle filtering and exact filtering. Finally, we illustrate the effectiveness of our method through an inventory control problem.
Keywords:Belief state;decision making;density projection;partially observable Markov decision processes (POMDPs);particle filtering;value function