logo
Loading...

How to Beat Pong Using Policy Gradients (LIVE) - Siraj Raval - 深度學習 Deep Learning 公開課 - Cupoy

We're going to use the policy gradient technique from reinforcement learning to beat the game of Pon...

We're going to use the policy gradient technique from reinforcement learning to beat the game of Pong. We'll use OpenAI's Universe as an environment for our agent and I'll go over the process of setting it up as well as the math behind the PG method in detail. Microphone popping issues end at 11:15 . That cannot happen again. Udacity is aware of this and will be more prepared next time. Code for this video: https://github.com/llSourcell/Policy_... Join us in the Wizards Slack channel: http://wizards.herokuapp.com/ More Learning resources: http://www.scholarpedia.org/article/P... http://proceedings.mlr.press/v32/silv... http://karpathy.github.io/2016/05/31/rl/ http://home.deib.polimi.it/restelli/M... http://www0.cs.ucl.ac.uk/staff/D.Silv... https://github.com/dennybritz/reinfor...