Reinforcement Learning: AlphaGo
How AlphaGo works, based on Reinforcement Learning. Part 2 of RL from scratch series. • Reinforcement Learning from scratch 0:00 - intro 0:06 - how to play Go 0:21 - introducing alphaGo 0:46 - analyzing expert games 2:17 - training an expert policy 2:47 - value functions 4:05 - search trees 5:42 - reinforcement learning 6:17 - alphaGo's value function 7:47 - alphaZero

▶︎
Reinforcement Learning: ChatGPT and RLHF

▶︎
Reinforcement Learning from scratch

▶︎
Reinforcement Learning: A (practical) introduction

▶︎
Reinforcement Learning: Essential Concepts

▶︎
AlphaGo - The Movie | Full award-winning documentary

▶︎
AI Revolution in Go: Corners & Sente

▶︎
Lee Sedol vs. AlphaGo: What Really Happened in the Match

▶︎
A visual guide on Reinforcement Learning - the 6 things that makes it “click”

▶︎
Deepmind AlphaZero - Mastering Games Without Human Knowledge

▶︎
AlphaZero: Shedding new light on the grand games of chess, shogi and Go

▶︎
How AI Learned to Think

▶︎
AlphaZero: An Introduction

▶︎
Reinforcement Learning, by the Book

▶︎
Terence Tao: Nobody Understands Why AI Actually Works

▶︎
We let AI buy a robot and a car, it does exactly what experts warned.

▶︎
The Story of AlphaGo: How AI Conquered the World's Hardest Game

▶︎
He Once Worked at Subway. At 58, He Solved An "Impossible" Problem

▶︎
The FASTEST introduction to Reinforcement Learning on the internet

▶︎
Anthropic is Completely F*cked.

▶︎
