Reward design

video-placeholder
Loading...
查看授课大纲

审阅

4.2(431 个评分)
  • 5 stars
    58.46%
  • 4 stars
    22.96%
  • 3 stars
    9.04%
  • 2 stars
    4.17%
  • 1 star
    5.33%
SF
Apr 8, 2020

At times it felt like a bit more video material would be helpful to better understand the subject/gain deeper understanding.\n\nAnd fixing some of the notebooks would be helpful.

FZ
Feb 13, 2019

A great course with very practical assignments to help you learn how to implement RL algorithms. But it also has some stupid quiz questions which makes you feel confusing.

从本节课中
At the heart of RL: Dynamic Programming
This week we'll consider the reinforcement learning formalisms in a more rigorous, mathematical way. You'll learn how to effectively compute the return your agent gets for a particular action - and how to pick best actions based on that return.

教学方

  • Placeholder

    Pavel Shvechikov

    Researcher at HSE and Sberbank AI Lab
  • Placeholder

    Alexander Panin

    Lecturer

探索我们的目录

免费加入并获得个性化推荐、更新和优惠。