IEEE Transactions on Automatic Control, Vol.49, No.10, 1779-1782, 2004
The Gauss-Seidel numerical procedure for Markov stochastic games
Consider the problem of value iteration for solving Markov stochastic games. One simply iterates backward, via a Jacobi-like procedure. The convergence of the Gauss-Seidel form of this procedure is shown for both the discounted and ergodic cost problems, under appropriate conditions, with extensions to problems where one stops when a boundary is hit or if any one of the players chooses to stop, with associated costs. Generally, the Gauss-Seidel procedure accelerates convergence.