Tokenization Explained: The Hidden Step Behind Every LLM

0:00 Part 1 — Text Is Not Numbers: The First Step in Every LLM 4:46 Part 2 — Why Not Just Characters or Words? 10:42 Part 3 — Byte-Pair Encoding: Learning the Vocabulary by Merging 16:21 Part 4 — Bytes, Not Letters: How GPT Never Says UNK 22:44 Part 5 — 100,000 Tokens: Real Tokenizers and Their Vocabularies 28:34 Part 6 — Why ChatGPT Can't Count the R's in Strawberry 33:59 Part 7 — The Hidden Cost: Tokens, Money, and Language Fairness How does a language model actually read your words? Before any "AI" happens, your text is shredded into subword pieces called tokens — and that invisible step shapes what GPT can count, what it costs, and who gets a fair deal. This 40-minute deep dive covers the full story from first principles to real-world failure modes.

Most devs don't understand how LLM tokens work

Most devs don't understand how LLM tokens work

Every Free App You Actually Need Explained in 20 Minutes

Every Free App You Actually Need Explained in 20 Minutes

Transformers, the tech behind LLMs | Deep Learning Chapter 5

Transformers, the tech behind LLMs | Deep Learning Chapter 5

Yann LeCun's $1B Bet Against LLMs [Part 1]

Yann LeCun's $1B Bet Against LLMs [Part 1]

The Strange Math That Predicts (Almost) Anything

The Strange Math That Predicts (Almost) Anything

Inside Anthropic, the $965 Billion AI Juggernaut | The Circuit

Inside Anthropic, the $965 Billion AI Juggernaut | The Circuit

Two of the Most Senior AI Engineers Just Said the Same 6 Words — Nobody Can Define Them

Two of the Most Senior AI Engineers Just Said the Same 6 Words — Nobody Can Define Them

Training Sand to Think: Artificial General Intelligence & Future of Physics

Training Sand to Think: Artificial General Intelligence & Future of Physics

But how do AI images and videos actually work? | Guest video by Welch Labs

But how do AI images and videos actually work? | Guest video by Welch Labs

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

The Uncomfortable Truth About AI “Reasoning” | World Science Festival

The Uncomfortable Truth About AI “Reasoning” | World Science Festival

Alien Life Explained: Are We Alone in a Dead Universe? | Sir David Attenborough

Alien Life Explained: Are We Alone in a Dead Universe? | Sir David Attenborough

Unfortunately, I Was Right

Unfortunately, I Was Right

But what is a neural network? | Deep learning chapter 1

But what is a neural network? | Deep learning chapter 1

Android 17 SUCKS. So I put Linux on a phone.

Android 17 SUCKS. So I put Linux on a phone.

This is not the AI we were promised | The Royal Society

This is not the AI we were promised | The Royal Society

Passkeys Explained: Are They Actually Better Than Passwords?

Passkeys Explained: Are They Actually Better Than Passwords?

I Gave ChatGPT a Body

I Gave ChatGPT a Body

Why DeepSeek V4 Has Everyone Freaking Out

Why DeepSeek V4 Has Everyone Freaking Out

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan