Stanford CS330: Multi-Task and Meta-Learning, 2019 | Lecture 6 - Reinforcement Learning Primer
For more information about Stanford’s Artificial Intelligence professional and graduate programs, visit: https://stanford.io/ai Assistant Professor Chelsea Finn, Stanford University http://cs330.stanford.edu/ 0:00 Introduction 0:46 Logistics 2:31 Why Reinforcement Learning? 3:37 The Plan 6:16 Terminology & notation 8:36 Imitation Learning 10:01 Reward functions 10:57 The goal of reinforcement learning 19:15 What is a reinforcement learning task? 21:01 The goal of multi-task reinforcement learning 23:31 The anatomy of a reinforcement learning algorithm 25:48 Evaluating the objective 26:43 Direct policy differentiation 32:02 Evaluating the policy gradient 33:16 Comparison to maximum likelihood 35:54 Example: MAML + policy gradient 37:25 Example: Black-box meta-learning + policy gradient 45:26 Policy Gradients 49:16 Value-Based RL: Definitions 52:14 Fitted Q-iteration Algorithm 56:13 Multi-Task RL Algorithms 58:00 An example

Stanford CS330: Multi-Task and Meta-Learning, 2019 | Lecture 7 - Kate Rakelly (UC Berkeley)

Stanford CS330: Multi-Task and Meta-Learning, 2019 | Lecture 5 - Bayesian Meta-Learning

The FASTEST introduction to Reinforcement Learning on the internet

Stanford CS330: Multi-Task and Meta-Learning, 2019 | Lecture 1 - Introduction & Overview
![Yann LeCun's $1B Bet Against LLMs [Part 1]](https://i.ytimg.com/vi/kYkIdXwW2AE/hqdefault.jpg?sqp=-oaymwEjCNACELwBSFryq4qpAxUIARUAAAAAGAElAADIQj0AgKJDeAE=&rs=AOn4CLDbV4izF3i-wxevCVIn7FJjoy1vlA)
Yann LeCun's $1B Bet Against LLMs [Part 1]

Trump Preps for 80th Birthday, Threatens to Hit Iran, Knicks Historic Win & Elon Musk Trillionaire!?

Something is jamming GPS over Europe. Here's what we found
![Master No Code Chatbots With Copilot Studio (Formerly Power Virtual Agents) [Full Course]](https://i.ytimg.com/vi/nYxf8ndIBE0/hqdefault.jpg?sqp=-oaymwEjCNACELwBSFryq4qpAxUIARUAAAAAGAElAADIQj0AgKJDeAE=&rs=AOn4CLCDSuC2zfv72qnTbKu4dkMBDhkYUg)
Master No Code Chatbots With Copilot Studio (Formerly Power Virtual Agents) [Full Course]

Causal Mechanistic Interpretability (Stanford lecture 1) - Atticus Geiger

Stanford CS330: Multi-Task and Meta-Learning, 2019 | Lecture 2 - Multi-Task & Meta-Learning Basics

Deep RL Bootcamp Lecture 4A: Policy Gradients

How to Speak

Stanford CS330: Multi-Task and Meta-Learning, 2019 | Lecture 9 - Lifelong Learning

RL for Agents Workshop - Deep Dive on Training Agents with RL and Open Source

Becoming an AI-Driven Leader: Harnessing AI to Make Faster, Smarter Decisions with Geoff Woods

Learning to learn: An Introduction to Meta Learning
![[AUTOML23] A Tutorial on MetaReinforcement Learning](https://i.ytimg.com/vi/XUQ9jLOZqGc/hqdefault.jpg?sqp=-oaymwE9CNACELwBSFryq4qpAy8IARUAAAAAGAElAADIQj0AgKJDeAHwAQH4Af4JgALQBYoCDAgAEAEYZSBlKGUwDw==&rs=AOn4CLAbTCtQu4DWxm7D_0zPxExBDiEAGg)
[AUTOML23] A Tutorial on MetaReinforcement Learning

Stanford CS330: Multi-Task and Meta-Learning, 2019 | Lecture 3 - Optimization-Based Meta-Learning

Stanford CS234: Reinforcement Learning | Winter 2019 | Lecture 1 - Introduction - Emma Brunskill

