|
![blank](/z.gif) |
Результат поиска |
Поиск книг, содержащих: Policy iteration
Книга | Страницы для поиска | Dietterich T.G., Becker S., Ghahramani Z. — Advances in neural information processing systems 14 (Vol. 1 and vol. 2) | 1515, 1531, 1547, 1579 | Kumar P.R., Varaiya P. — Stochastic Systems: Estimation, Identification, and Adaptive Control | 153, 162 | Shanbhag D.N. (ed.), Rao C.R. (ed.) — Stochastic Processes - Modelling and Simulation | 14 | Bertsekas D.P. — Dynamic programming and optimal control (Vol. 1) | 303, 308, 321 | Sutton R.S., Barto A.G. — Reinforcement Learning | 97—100, see also "Generalized policy iteration" | Bertsekas D.P. — Dynamic programming and optimal control (Vol. 2) | 35, 71, 73, 91, 149, 180, 213, 223 | Powell W.B. — Approximate dynamic programming: Solving the curses of dimensionality | 62, 282 | BertsekasD., Tsitsiklis J. — Neuro-Dynamic Programming (Optimization and Neural Computation Series, 3) | 29, 41 |
|
|