A Tutorial on Linear Function Approximators for Dynamic Programming and Reinforcement Learning
A Markov Decision Process (MDP) is a natural framework for formulating sequential decision-making problems under uncertainty. In recent years, researchers have greatly advanced algorithms for learning and acting in MDPs. This book reviews such algorithms.
Specificaties
| ISBN/EAN | 9781601987600 |
| Auteur | Alborz Geramifard |
| Uitgever | Van Ditmar Boekenimport B.V. |
| Taal | Engels |
| Uitvoering | Paperback / gebrocheerd |
| Pagina's | 92 |
| Lengte | |
| Breedte |
