Deep RL Bootcamp Lecture 6: Nuts and Bolts of Deep RL Experimentation
Instructor: John Schulman (OpenAI) Lecture 6 Deep RL Bootcamp Berkeley August 2017 Nuts and Bolts of Deep RL Experimentation

▶︎
Deep RL Bootcamp Lecture 4A: Policy Gradients

▶︎
Deep RL Bootcamp Lecture 4B Policy Gradients Revisited

▶︎
Deep RL Bootcamp Lecture 7 SVG, DDPG, and Stochastic Computation Graphs (John Schulman)

▶︎
Deep RL Bootcamp Lecture 1: Motivation + Overview + Exact Solution Methods

▶︎
DeepMind's Richard Sutton - The Long-term of AI & Temporal-Difference Learning

▶︎
Model Based RL Finally Works!

▶︎
Wie schlägt sich unsere Regierung? Halbzeitanalyse mit Fabian Köster | heute-show

▶︎
The Strange Math That Predicts (Almost) Anything

▶︎
Die Zombie-Simulation, die niemand erklären kann

▶︎
Ilya Sutskever: OpenAI Meta-Learning and Self-Play | MIT Artificial General Intelligence (AGI)

▶︎
Oligarchy is worse than you think

▶︎
Proximal Policy Optimization (PPO) is Easy With PyTorch | Full PPO Tutorial

▶︎
Deep RL Bootcamp Lecture 3: Deep Q-Networks

▶︎
Training AI Without Writing A Reward Function, with Reward Modelling

▶︎
L3 Policy Gradients and Advantage Estimation (Foundations of Deep RL Series)

▶︎
Train Your Brain to Never Forget (5 Feynman Habits)

▶︎
Zig says NO to AI

▶︎
Stanford CS25: V2 I Introduction to Transformers w/ Andrej Karpathy

▶︎
Reinforcement Learning: Machine Learning Meets Control Theory

▶︎
