Advantage actor-critic

Loading...
查看授课大纲

审阅

4.1(261 个评分)
  • 5 stars
    138 ratings
  • 4 stars
    66 ratings
  • 3 stars
    26 ratings
  • 2 stars
    11 ratings
  • 1 star
    20 ratings
VO

Mar 17, 2019

Well Prepared and taught course.. Will highly recommend as the primer for reinforcement learning

JJ

Sep 15, 2019

Fantastic class if you don't mind to overcome some code issues in the homework.

从本节课中
Policy-based methods
We spent 3 previous modules working on the value-based methods: learning state values, action values and whatnot. Now's the time to see an alternative approach that doesn't require you to predict all future rewards to learn something.

教学方

  • Pavel Shvechikov

    Pavel Shvechikov

    Researcher at HSE and Sberbank AI Lab
  • Alexander Panin

    Alexander Panin

    Lecturer

探索我们的目录

免费加入并获得个性化推荐、更新和优惠。