A Tutorial on Linear Function Approximators for Dynamic Programming and Reinforcement Learning
A Markov Decision Process (MDP) is a natural framework for formulating sequential decision-making problems under uncertainty. In recent years, researchers have greatly advanced algorithms for learning and acting in MDPs. This book reviews such algorithms.
Specificaties
ISBN/EAN | 9781601987600 |
Auteur | Alborz Geramifard |
Uitgever | Van Ditmar Boekenimport B.V. |
Taal | Engels |
Uitvoering | Paperback / gebrocheerd |
Pagina's | 92 |
Lengte | |
Breedte |