AlphaZero Explained: How it Learns [Convolutional Neural Network]

How does AlphaZero use neural networks to play games? How does it learn strategies from self play? I explain the details using an interactive Connect 4 AI that I built in the Python package Marimo. Link to interactive notebook for the CNN explorer: https://molab.marimo.io/notebooks/nb_... Link to Previous 2 videos in my AlphaZero explained series: Link to Part 1 where the UCB Algorithm is explained in more detail: • AlphaZero Explained 1: How it Solves "Expl... and Marimo UCB notebook https://molab.marimo.io/notebooks/nb_... Part 2 where we build up trees to look several moves ahead [Monte Carlo Tree Search] • AlphaZero Explained 2: How it Looks Into t... and Marimo MCTS notebook https://molab.marimo.io/notebooks/nb_... Chapters: 0:00 Introduction to AlphaZero and Connect 4 0:47 How AlphaZero uses neural networks 2:09 The training process: Reinforcement Learning vs. Supervised Learning 4:28 The bootstrapped training loop 5:51 Monte Carlo Tree Search (MCTS) and the expert function 8:33 The PUCT (Predictor/Polynomial Upper Confidence Tree) formula 9:45 Interactive Python/Marimo demo: MCTS vs. Neural Network 12:39 Training results and performance metrics 15:44 Visualizing the Convolutional Neural Network (CNN) 17:13 Feature planes and 3D data representation 20:13 How convolutions work: Parameters and filters 24:04 Interpreting weights and neuron activations 28:18 ReLU Non-linearity explained 30:08 Rollout upgrades for world-class performance

AlphaZero Explained 1: How it Solves "Explore vs Exploit" [UCB Algorithm]

AlphaZero Explained 1: How it Solves "Explore vs Exploit" [UCB Algorithm]

Why AI Tokens are so Expensive - Computerphile

Why AI Tokens are so Expensive - Computerphile

Yann LeCun's $1B Bet Against LLMs [Part 1]

Yann LeCun's $1B Bet Against LLMs [Part 1]

The best "Guess Who?" strategy (and how I proved it)

The best "Guess Who?" strategy (and how I proved it)

Hypercube Bridge Rectifiers and the Path of Least Resistance: What keeps me up at night.

Hypercube Bridge Rectifiers and the Path of Least Resistance: What keeps me up at night.

Android 17 sucks. So I put Linux on a phone.

Android 17 sucks. So I put Linux on a phone.

Violence Expert: Real Self-Defense Is TERRIFYING

Violence Expert: Real Self-Defense Is TERRIFYING

The insane engineering of Deepseek V4

The insane engineering of Deepseek V4

They LAUGHED at this White Rapper...then he started Rapping | Chris Turner's Freestyle Raps

They LAUGHED at this White Rapper...then he started Rapping | Chris Turner's Freestyle Raps

But what is quantum computing? (Grover's Algorithm)

But what is quantum computing? (Grover's Algorithm)

How the Super Soaker Inventor Just Killed the Steam Engine

How the Super Soaker Inventor Just Killed the Steam Engine

The Real Reason European Cars Can't Compete

The Real Reason European Cars Can't Compete

AlphaZero Explained 2: How it Looks Into the Future [Monte Carlo Tree Search]

AlphaZero Explained 2: How it Looks Into the Future [Monte Carlo Tree Search]

Yann LeCun: World Models: Enabling the next AI revolution

Yann LeCun: World Models: Enabling the next AI revolution

The Strange Math That Predicts (Almost) Anything

The Strange Math That Predicts (Almost) Anything

God Says:"MY CHILD, I NEED TO SEE YOU URGENTLY!"/God Message Now/God Message

God Says:"MY CHILD, I NEED TO SEE YOU URGENTLY!"/God Message Now/God Message

Something is jamming GPS over Europe. Here's what we found

Something is jamming GPS over Europe. Here's what we found

AI Bubble: The data center oversupply crisis is coming | Ed Zitron

AI Bubble: The data center oversupply crisis is coming | Ed Zitron

LLM that loops instead of Doing Chain-of-Thought

LLM that loops instead of Doing Chain-of-Thought

How The CIA Hacked Russia

How The CIA Hacked Russia