Exploration in Reinforcement Learning: Bandits, UCB & Thompson Sampling

When should an agent try something new instead of cashing in what it already knows? A silent, animated explainer on exploration in reinforcement learning. Covered: • The explore/exploit dilemma • Multi-armed and contextual bandits as the simplest RL • epsilon-greedy • Optimism in the face of uncertainty: UCB • Thompson sampling (posterior sampling) • Curiosity and intrinsic rewards: count-based, prediction error, RND, empowerment, information gain Built with Manim. No narration or music; everything is explained on screen.

Euler's Identity: e^(iπ) + 1 = 0, and the Genius Behind It

Euler's Identity: e^(iπ) + 1 = 0, and the Genius Behind It

The Nuclear Pore Complex: How an Open Hole Is a Selective Gate

The Nuclear Pore Complex: How an Open Hole Is a Selective Gate

I 100%'d the Backyard Nuclear Bomb Building Game

I 100%'d the Backyard Nuclear Bomb Building Game

Hans Zimmer - Interstellar (Space Sounds)

Hans Zimmer - Interstellar (Space Sounds)

The FASTEST introduction to Reinforcement Learning on the internet

The FASTEST introduction to Reinforcement Learning on the internet

🔴 Makkah Live | مكة مباشر | الحرم المكي مباشر | قناة القران الكريم السعودية مباشر | مكه المكرمه

🔴 Makkah Live | مكة مباشر | الحرم المكي مباشر | قناة القران الكريم السعودية مباشر | مكه المكرمه

If You Have A Bad Memory, I’ll Help You Fix It In 28 Minutes

If You Have A Bad Memory, I’ll Help You Fix It In 28 Minutes

System Design Course – APIs, Databases, Caching, CDNs, Load Balancing & Production Infra

System Design Course – APIs, Databases, Caching, CDNs, Load Balancing & Production Infra

PINK & ORANGE GRADIENT IN HD [3 HOURS]

PINK & ORANGE GRADIENT IN HD [3 HOURS]

Only Dangerously Smart People Think Like This

Only Dangerously Smart People Think Like This

Let's build GPT: from scratch, in code, spelled out.

Let's build GPT: from scratch, in code, spelled out.

What do tech pioneers think about the AI revolution? - The Engineers, BBC World Service

What do tech pioneers think about the AI revolution? - The Engineers, BBC World Service

He Once Worked at Subway. At 58, He Solved An "Impossible" Problem

He Once Worked at Subway. At 58, He Solved An "Impossible" Problem

Belgien – Iran Highlights | Gruppe G, FIFA WM 2026 | sportstudio

Belgien – Iran Highlights | Gruppe G, FIFA WM 2026 | sportstudio

Beautiful Relaxing Music - Stop Overthinking, Stress Relief Music, Sleep Music, Calming Music #177

Beautiful Relaxing Music - Stop Overthinking, Stress Relief Music, Sleep Music, Calming Music #177

The Most Misunderstood Concept in Physics

The Most Misunderstood Concept in Physics

Overexplaining the binomial distribution

Overexplaining the binomial distribution

Morphogenetic Fields & Bioelectricity: where the body's blueprint hides

Morphogenetic Fields & Bioelectricity: where the body's blueprint hides

How To Become Dangerously Self-Educated (with AI)

How To Become Dangerously Self-Educated (with AI)

AlphaFold - The Most Useful Thing AI Has Ever Done

AlphaFold - The Most Useful Thing AI Has Ever Done