State and Action Value Functions

Loading...
来自 国立高等经济大学 的课程
Practical Reinforcement Learning
5 评分
国立高等经济大学
5 评分
课程 4(共 7 门,Specialization Advanced Machine Learning
从本节课中
At the heart of RL: Dynamic Programming
This week we'll consider the reinforcement learning formalisms in a more rigorous, mathematical way. You'll learn how to effectively compute the return your agent gets for a particular action - and how to pick best actions based on that return.

与讲师见面

  • Pavel Shvechikov
    Pavel Shvechikov
    Researcher at HSE and Sberbank AI Lab
    HSE Faculty of Computer Science
  • Alexander Panin
    Alexander Panin
    Lecturer
    HSE Faculty of Computer Science

探索我们的目录

免费加入并获得个性化推荐、更新和优惠。