Mechanistic Interpretability explained | Chris Olah and Lex Fridman

Lex Fridman Podcast full episode: • Dario Amodei: Anthropic CEO on Claude, AGI... Thank you for listening ❤ Check out our sponsors: https://lexfridman.com/sponsors/cv824... See below for guest bio, links, and to give feedback, submit questions, contact Lex, etc. GUEST BIO: Dario Amodei is the CEO of Anthropic, the company that created Claude. Amanda Askell is an AI researcher working on Claude's character and personality. Chris Olah is an AI researcher working on mechanistic interpretability. CONTACT LEX: Feedback - give feedback to Lex: https://lexfridman.com/survey AMA - submit questions, videos or call-in: https://lexfridman.com/ama Hiring - join our team: https://lexfridman.com/hiring Other - other ways to get in touch: https://lexfridman.com/contact EPISODE LINKS: Claude: https://claude.ai Anthropic's X: https://x.com/AnthropicAI Anthropic's Website: https://anthropic.com Dario's X: https://x.com/DarioAmodei Dario's Website: https://darioamodei.com Machines of Loving Grace (Essay): https://darioamodei.com/machines-of-l... Chris's X: https://x.com/ch402 Chris's Blog: https://colah.github.io Amanda's X: https://x.com/AmandaAskell Amanda's Website: https://askell.io SPONSORS: To support this podcast, check out our sponsors & get discounts: Encord: AI tooling for annotation & data management. Go to https://lexfridman.com/s/encord-cv824... Notion: Note-taking and team collaboration. Go to https://lexfridman.com/s/notion-cv824... Shopify: Sell stuff online. Go to https://lexfridman.com/s/shopify-cv82... BetterHelp: Online therapy and counseling. Go to https://lexfridman.com/s/betterhelp-c... LMNT: Zero-sugar electrolyte drink mix. Go to https://lexfridman.com/s/lmnt-cv8247-sb PODCAST LINKS: Podcast Website: https://lexfridman.com/podcast Apple Podcasts: https://apple.co/2lwqZIr Spotify: https://spoti.fi/2nEwCF8 RSS: https://lexfridman.com/feed/podcast/ Podcast Playlist: • Lex Fridman Podcast Clips Channel: / lexclips SOCIAL LINKS: X: https://x.com/lexfridman Instagram: / lexfridman TikTok: / lexfridman LinkedIn: / lexfridman Facebook: / lexfridman Patreon: / lexfridman Telegram: https://t.me/lexfridman Reddit: / lexfridman

Chris Olah - Looking Inside Neural Networks with Mechanistic Interpretability

Chris Olah - Looking Inside Neural Networks with Mechanistic Interpretability

Physicist explains the nature of time: It's a mind-blowing mystery | Don Lincoln and Lex Fridman

Physicist explains the nature of time: It's a mind-blowing mystery | Don Lincoln and Lex Fridman

Why a Top Law Firm Bought Its Own AI Company

Why a Top Law Firm Bought Its Own AI Company

Unsolved problems in AI | Chris Olah and Lex Fridman

Unsolved problems in AI | Chris Olah and Lex Fridman

Controversial theory about Göbekli Tepe | Irving Finkel and Lex Fridman

Controversial theory about Göbekli Tepe | Irving Finkel and Lex Fridman

Demis Hassabis: Agents, AGI & The Next Big Scientific Breakthrough

Demis Hassabis: Agents, AGI & The Next Big Scientific Breakthrough

An Introduction to Mechanistic Interpretability – Neel Nanda | IASEAI 2025

An Introduction to Mechanistic Interpretability – Neel Nanda | IASEAI 2025

Mathematician explains Gödel's Incompleteness Theorem | Edward Frenkel and Lex Fridman

Mathematician explains Gödel's Incompleteness Theorem | Edward Frenkel and Lex Fridman

Do LLMs Understand? AI Pioneer Yann LeCun Spars with DeepMind’s Adam Brown.

Do LLMs Understand? AI Pioneer Yann LeCun Spars with DeepMind’s Adam Brown.

Mechanistic Interpretability for NLP: One-stop Guide for Everything you Need to Know

Mechanistic Interpretability for NLP: One-stop Guide for Everything you Need to Know

Interpretability: Understanding how AI models think

Interpretability: Understanding how AI models think

Dark matter explained by physicist | Don Lincoln and Lex Fridman

Dark matter explained by physicist | Don Lincoln and Lex Fridman

Hacking LLMs: An Introduction to Mechanistic Interpretability — Jenny Vega

Hacking LLMs: An Introduction to Mechanistic Interpretability — Jenny Vega

Meet the Former CIA Agent Who Wants to Abolish the CIA

Meet the Former CIA Agent Who Wants to Abolish the CIA

Building Anthropic | A conversation with our co-founders

Building Anthropic | A conversation with our co-founders

We think this pattern continues forever, but can't prove it

We think this pattern continues forever, but can't prove it

The Hardest Questions in Physics | World Science Festival

The Hardest Questions in Physics | World Science Festival

Neel Nanda – Mechanistic Interpretability: A Whirlwind Tour

Neel Nanda – Mechanistic Interpretability: A Whirlwind Tour

What Matters Right Now In Mechanistic Interpretability?

What Matters Right Now In Mechanistic Interpretability?

Christopher Olah: Anthropic’s Core Views on AI SafetyChristopher Olah

Christopher Olah: Anthropic’s Core Views on AI SafetyChristopher Olah