The Policy Iteration Algorithm for Average Continuous Control of Piecewise Deterministic Markov Processes

Costa OLV; Dufour F

Applied Mathematics and Optimization, Vol.62, No.2, 185-204, 2010

DOI10.1007/s00245-010-9099-4 Export Citation

The Policy Iteration Algorithm for Average Continuous Control of Piecewise Deterministic Markov Processes

The main goal of this paper is to apply the so-called policy iteration algorithm (PIA) for the long run average continuous control problem of piecewise deterministic Markov processes (PDMP's) taking values in a general Borel space and with compact action space depending on the state variable. In order to do that we first derive some important properties for a pseudo-Poisson equation associated to the problem. In the sequence it is shown that the convergence of the PIA to a solution satisfying the optimality equation holds under some classical hypotheses and that this optimal solution yields to an optimal control strategy for the average control problem for the continuous-time PDMP in a feedback form.

Keywords:Piecewise-deterministic Markov Processes;Continuous-time;Long-run average cost;Optimal control;Integro-differential optimality inequation;Policy iteration algorithm