Stanford CS330: Multi-Task and Meta-Learning, 2019 | Lecture 6 - Reinforcement Learning Primer

For more information about Stanford’s Artificial Intelligence professional and graduate programs, visit: https://stanford.io/ai Assistant Professor Chelsea Finn, Stanford University http://cs330.stanford.edu/ 0:00 Introduction 0:46 Logistics 2:31 Why Reinforcement Learning? 3:37 The Plan 6:16 Terminology & notation 8:36 Imitation Learning 10:01 Reward functions 10:57 The goal of reinforcement learning 19:15 What is a reinforcement learning task? 21:01 The goal of multi-task reinforcement learning 23:31 The anatomy of a reinforcement learning algorithm 25:48 Evaluating the objective 26:43 Direct policy differentiation 32:02 Evaluating the policy gradient 33:16 Comparison to maximum likelihood 35:54 Example: MAML + policy gradient 37:25 Example: Black-box meta-learning + policy gradient 45:26 Policy Gradients 49:16 Value-Based RL: Definitions 52:14 Fitted Q-iteration Algorithm 56:13 Multi-Task RL Algorithms 58:00 An example

Stanford CS330: Multi-Task and Meta-Learning, 2019 | Lecture 7 - Kate Rakelly (UC Berkeley)

Stanford CS330: Multi-Task and Meta-Learning, 2019 | Lecture 7 - Kate Rakelly (UC Berkeley)

Stanford CS330: Multi-Task and Meta-Learning, 2019 | Lecture 5 - Bayesian Meta-Learning

Stanford CS330: Multi-Task and Meta-Learning, 2019 | Lecture 5 - Bayesian Meta-Learning

The FASTEST introduction to Reinforcement Learning on the internet

The FASTEST introduction to Reinforcement Learning on the internet

Stanford CS330: Multi-Task and Meta-Learning, 2019 | Lecture 1 - Introduction & Overview

Stanford CS330: Multi-Task and Meta-Learning, 2019 | Lecture 1 - Introduction & Overview

Yann LeCun's $1B Bet Against LLMs [Part 1]

Yann LeCun's $1B Bet Against LLMs [Part 1]

Trump Preps for 80th Birthday, Threatens to Hit Iran, Knicks Historic Win & Elon Musk Trillionaire!?

Trump Preps for 80th Birthday, Threatens to Hit Iran, Knicks Historic Win & Elon Musk Trillionaire!?

Something is jamming GPS over Europe. Here's what we found

Something is jamming GPS over Europe. Here's what we found

Master No Code Chatbots With Copilot Studio (Formerly Power Virtual Agents) [Full Course]

Master No Code Chatbots With Copilot Studio (Formerly Power Virtual Agents) [Full Course]

Causal Mechanistic Interpretability (Stanford lecture 1) - Atticus Geiger

Causal Mechanistic Interpretability (Stanford lecture 1) - Atticus Geiger

Stanford CS330: Multi-Task and Meta-Learning, 2019 | Lecture 2 - Multi-Task & Meta-Learning Basics

Stanford CS330: Multi-Task and Meta-Learning, 2019 | Lecture 2 - Multi-Task & Meta-Learning Basics

Deep RL Bootcamp Lecture 4A: Policy Gradients

Deep RL Bootcamp Lecture 4A: Policy Gradients

How to Speak

How to Speak

Stanford CS330: Multi-Task and Meta-Learning, 2019 | Lecture 9 - Lifelong Learning

Stanford CS330: Multi-Task and Meta-Learning, 2019 | Lecture 9 - Lifelong Learning

RL for Agents Workshop - Deep Dive on Training Agents with RL and Open Source

RL for Agents Workshop - Deep Dive on Training Agents with RL and Open Source

Becoming an AI-Driven Leader: Harnessing AI to Make Faster, Smarter Decisions with Geoff Woods

Becoming an AI-Driven Leader: Harnessing AI to Make Faster, Smarter Decisions with Geoff Woods

Learning to learn: An Introduction to Meta Learning

Learning to learn: An Introduction to Meta Learning

[AUTOML23] A Tutorial on MetaReinforcement Learning

[AUTOML23] A Tutorial on MetaReinforcement Learning

Stanford CS330: Multi-Task and Meta-Learning, 2019 | Lecture 3 - Optimization-Based Meta-Learning

Stanford CS330: Multi-Task and Meta-Learning, 2019 | Lecture 3 - Optimization-Based Meta-Learning

Stanford CS234: Reinforcement Learning | Winter 2019 | Lecture 1 - Introduction - Emma Brunskill

Stanford CS234: Reinforcement Learning | Winter 2019 | Lecture 1 - Introduction - Emma Brunskill

1: Introduction to Neural Networks and Deep Learning; Training Deep NNs

1: Introduction to Neural Networks and Deep Learning; Training Deep NNs