AlphaZero Explained: How it Learns [Convolutional Neural Network]
How does AlphaZero use neural networks to play games? How does it learn strategies from self play? I explain the details using an interactive Connect 4 AI that I built in the Python package Marimo. Link to interactive notebook for the CNN explorer: https://molab.marimo.io/notebooks/nb_... Link to Previous 2 videos in my AlphaZero explained series: Link to Part 1 where the UCB Algorithm is explained in more detail: • AlphaZero Explained 1: How it Solves "Expl... and Marimo UCB notebook https://molab.marimo.io/notebooks/nb_... Part 2 where we build up trees to look several moves ahead [Monte Carlo Tree Search] • AlphaZero Explained 2: How it Looks Into t... and Marimo MCTS notebook https://molab.marimo.io/notebooks/nb_... Chapters: 0:00 Introduction to AlphaZero and Connect 4 0:47 How AlphaZero uses neural networks 2:09 The training process: Reinforcement Learning vs. Supervised Learning 4:28 The bootstrapped training loop 5:51 Monte Carlo Tree Search (MCTS) and the expert function 8:33 The PUCT (Predictor/Polynomial Upper Confidence Tree) formula 9:45 Interactive Python/Marimo demo: MCTS vs. Neural Network 12:39 Training results and performance metrics 15:44 Visualizing the Convolutional Neural Network (CNN) 17:13 Feature planes and 3D data representation 20:13 How convolutions work: Parameters and filters 24:04 Interpreting weights and neuron activations 28:18 ReLU Non-linearity explained 30:08 Rollout upgrades for world-class performance
![AlphaZero Explained 1: How it Solves "Explore vs Exploit" [UCB Algorithm]](https://i.ytimg.com/vi/mYZlV44UzLc/hqdefault_custom_2.jpg?sqp=CITRqtIG-oaymwEjCNACELwBSFryq4qpAxUIARUAAAAAGAElAADIQj0AgKJDeAE=&rs=AOn4CLA2qhFsG4DMfXfAncwL4WbxNGqNNA)
AlphaZero Explained 1: How it Solves "Explore vs Exploit" [UCB Algorithm]

Why AI Tokens are so Expensive - Computerphile
![Yann LeCun's $1B Bet Against LLMs [Part 1]](https://i.ytimg.com/vi/kYkIdXwW2AE/hqdefault.jpg?sqp=-oaymwEjCNACELwBSFryq4qpAxUIARUAAAAAGAElAADIQj0AgKJDeAE=&rs=AOn4CLDbV4izF3i-wxevCVIn7FJjoy1vlA)
Yann LeCun's $1B Bet Against LLMs [Part 1]

The best "Guess Who?" strategy (and how I proved it)

Hypercube Bridge Rectifiers and the Path of Least Resistance: What keeps me up at night.

Android 17 sucks. So I put Linux on a phone.

Violence Expert: Real Self-Defense Is TERRIFYING

The insane engineering of Deepseek V4

They LAUGHED at this White Rapper...then he started Rapping | Chris Turner's Freestyle Raps

But what is quantum computing? (Grover's Algorithm)

How the Super Soaker Inventor Just Killed the Steam Engine

The Real Reason European Cars Can't Compete
![AlphaZero Explained 2: How it Looks Into the Future [Monte Carlo Tree Search]](https://i.ytimg.com/vi/uQUsAndESdQ/hqdefault_custom_1.jpg?sqp=CITRqtIG-oaymwEjCNACELwBSFryq4qpAxUIARUAAAAAGAElAADIQj0AgKJDeAE=&rs=AOn4CLCdPpfvR9P-qOdF7GoIrw5A_gwVig)
AlphaZero Explained 2: How it Looks Into the Future [Monte Carlo Tree Search]

Yann LeCun: World Models: Enabling the next AI revolution

The Strange Math That Predicts (Almost) Anything

God Says:"MY CHILD, I NEED TO SEE YOU URGENTLY!"/God Message Now/God Message

Something is jamming GPS over Europe. Here's what we found

AI Bubble: The data center oversupply crisis is coming | Ed Zitron

LLM that loops instead of Doing Chain-of-Thought

