A turnpike theorem for a risk-sensitive Markov decision process with stopping

Denardo EV; Rothblum UG

SIAM Journal on Control and Optimization, Vol.45, No.2, 414-431, 2006

DOI10.1137/S0363012904442616 Export Citation

A turnpike theorem for a risk-sensitive Markov decision process with stopping

This paper concerns a Markov decision problem with an infinite planning horizon, stationary data, an exponential utility function, and no discounting, but also with the possibility of voluntary or involuntary termination ( stopping). This paper establishes conditions under which the expected utility of the income received by repeated use of a well-chosen stationary control equals the limit, as n becomes large, of the optimal expected utility for the n-period problem. Tests for these (turnpike) conditions are provided, as is a linear program whose optimal solution determines such a control.

Keywords:Markov control process;risk-sensitive dynamic programming;stopping;turnpike theorem;overtaking optimality