Advantage actor-critic

Loading...
查看授课大纲

审阅

4.1(285 个评分)
  • 5 stars
    153 ratings
  • 4 stars
    73 ratings
  • 3 stars
    27 ratings
  • 2 stars
    12 ratings
  • 1 star
    20 ratings
LJ

Oct 07, 2019

Challenging (unlike many other courses on Coursera, it does not baby you and does not seem to be targeting as high a pass rate as possible), but very very rewarding.

VO

Mar 17, 2019

Well Prepared and taught course.. Will highly recommend as the primer for reinforcement learning

从本节课中
Policy-based methods
We spent 3 previous modules working on the value-based methods: learning state values, action values and whatnot. Now's the time to see an alternative approach that doesn't require you to predict all future rewards to learn something.

教学方

  • Pavel Shvechikov

    Pavel Shvechikov

    Researcher at HSE and Sberbank AI Lab
  • Alexander Panin

    Alexander Panin

    Lecturer

探索我们的目录

免费加入并获得个性化推荐、更新和优惠。