Case study: A3C

Loading...
查看授课大纲

审阅

4.2(383 个评分)
  • 5 stars
    56.39%
  • 4 stars
    23.75%
  • 3 stars
    9.13%
  • 2 stars
    4.69%
  • 1 star
    6%
FZ

Feb 14, 2019

A great course with very practical assignments to help you learn how to implement RL algorithms. But it also has some stupid quiz questions which makes you feel confusing.

LJ

Oct 07, 2019

Challenging (unlike many other courses on Coursera, it does not baby you and does not seem to be targeting as high a pass rate as possible), but very very rewarding.

从本节课中
Policy-based methods
We spent 3 previous modules working on the value-based methods: learning state values, action values and whatnot. Now's the time to see an alternative approach that doesn't require you to predict all future rewards to learn something.

教学方

  • Placeholder

    Pavel Shvechikov

    Researcher at HSE and Sberbank AI Lab
  • Placeholder

    Alexander Panin

    Lecturer

探索我们的目录

免费加入并获得个性化推荐、更新和优惠。