Gradient-Based Myopic Allocation Policy: An Efficient Sampling Procedure in a Low-Confidence Scenario

Peng YJ; Chen CH; Fu MC; Hu JQ

IEEE Transactions on Automatic Control, Vol.63, No.9, 3091-3097, 2018

DOI10.1109/TAC.2017.2776606 Export Citation

Gradient-Based Myopic Allocation Policy: An Efficient Sampling Procedure in a Low-Confidence Scenario

In this note, we study a simulation optimization problem of selecting the alternative with the best performance from a finite set, or a so-called ranking and selection problem, in a special low-confidence scenario. The most popular sampling allocation procedures in ranking and selection do not perform well in this scenario, because they all ignore certain induced correlations that significantly affect the probability of correct selection in this scenario. We propose a gradient-based myopic allocation policy that takes the induced correlations into account, reflecting a tradeoff between the induced correlation and the two factors (mean-variance) found in the optimal computing budget allocation formula. Numerical experiments substantiate the efficiency of the new procedure in the low-confidence scenario.

Keywords:Bayesian framework;myopic allocation policy (MAP);ranking and selection;simulation