검색결과 : 2건
No. | Article |
---|---|
1 |
Stochastic approximation or nonexpansive maps: Application to Q-learning algorithms Abounadi J, Bertsekas DP, Borkar V SIAM Journal on Control and Optimization, 41(1), 1, 2002 |
2 |
Learning algorithms or Markov decision processes with average cost Abounadi J, Bertsekas D, Borkar VS SIAM Journal on Control and Optimization, 40(3), 681, 2001 |