Bellman Optimality Equations, Planning in MDPs Leave a Comment / Reinforcement Learning Theory / By Lixinjack
Bellman consistency equation; MRPs; Optimal value function Leave a Comment / Reinforcement Learning Theory / By Lixinjack