화학공학소재연구정보센터
Automatica, Vol.67, 77-84, 2016
A unified approach to time-aggregated Markov decision processes
This paper presents a unified approach to time-aggregated Markov decision processes (MDPs) with an average cost criterion. The approach is based on a framework in which a time-aggregated MDP constitutes a semi-Markov decision process (SMDP). By analyzing the performance sensitivity formulas of this SMDP, a number of optimization algorithms for time aggregated MDPs, including those previously reported in the literature, can be developed in a simple and intuitive way. (C) 2016 Elsevier Ltd. All rights reserved.