Mechanistic Interpretability explained | Chris Olah and Lex Fridman

Lex Fridman Podcast full episode:    • Dario Amodei: Anthropic CEO on Claude, AGI...   Thank you for listening ❤ Check out our sponsors: https://lexfridman.com/sponsors/cv824... See below for guest bio, links, and to give feedback, submit questions, contact Lex, etc. GUEST BIO: Dario Amodei is the CEO of Anthropic, the company that created Claude. Amanda Askell is an AI researcher working on Claude's character and personality. Chris Olah is an AI researcher working on mechanistic interpretability. CONTACT LEX: Feedback - give feedback to Lex: https://lexfridman.com/survey AMA - submit questions, videos or call-in: https://lexfridman.com/ama Hiring - join our team: https://lexfridman.com/hiring Other - other ways to get in touch: https://lexfridman.com/contact EPISODE LINKS: Claude: https://claude.ai Anthropic's X: https://x.com/AnthropicAI Anthropic's Website: https://anthropic.com Dario's X: https://x.com/DarioAmodei Dario's Website: https://darioamodei.com Machines of Loving Grace (Essay): https://darioamodei.com/machines-of-l... Chris's X: https://x.com/ch402 Chris's Blog: https://colah.github.io Amanda's X: https://x.com/AmandaAskell Amanda's Website: https://askell.io SPONSORS: To support this podcast, check out our sponsors & get discounts: Encord: AI tooling for annotation & data management. Go to https://lexfridman.com/s/encord-cv824... Notion: Note-taking and team collaboration. Go to https://lexfridman.com/s/notion-cv824... Shopify: Sell stuff online. Go to https://lexfridman.com/s/shopify-cv82... BetterHelp: Online therapy and counseling. Go to https://lexfridman.com/s/betterhelp-c... LMNT: Zero-sugar electrolyte drink mix. Go to https://lexfridman.com/s/lmnt-cv8247-sb PODCAST LINKS: Podcast Website: https://lexfridman.com/podcast Apple Podcasts: https://apple.co/2lwqZIr Spotify: https://spoti.fi/2nEwCF8 RSS: https://lexfridman.com/feed/podcast/ Podcast Playlist:    • Lex Fridman Podcast   Clips Channel:    / lexclips   SOCIAL LINKS: X: https://x.com/lexfridman Instagram:   / lexfridman   TikTok:   / lexfridman   LinkedIn:   / lexfridman   Facebook:   / lexfridman   Patreon:   / lexfridman   Telegram: https://t.me/lexfridman Reddit:   / lexfridman  

Chris Olah - Looking Inside Neural Networks with Mechanistic Interpretability
▶︎

Chris Olah - Looking Inside Neural Networks with Mechanistic Interpretability

Physicist explains the nature of time: It's a mind-blowing mystery | Don Lincoln and Lex Fridman
▶︎

Physicist explains the nature of time: It's a mind-blowing mystery | Don Lincoln and Lex Fridman

Why a Top Law Firm Bought Its Own AI Company
▶︎

Why a Top Law Firm Bought Its Own AI Company

Unsolved problems in AI | Chris Olah and Lex Fridman
▶︎

Unsolved problems in AI | Chris Olah and Lex Fridman

Controversial theory about Göbekli Tepe | Irving Finkel and Lex Fridman
▶︎

Controversial theory about Göbekli Tepe | Irving Finkel and Lex Fridman

Demis Hassabis: Agents, AGI & The Next Big Scientific Breakthrough
▶︎

Demis Hassabis: Agents, AGI & The Next Big Scientific Breakthrough

An Introduction to Mechanistic Interpretability – Neel Nanda | IASEAI 2025
▶︎

An Introduction to Mechanistic Interpretability – Neel Nanda | IASEAI 2025

Mathematician explains Gödel's Incompleteness Theorem | Edward Frenkel and Lex Fridman
▶︎

Mathematician explains Gödel's Incompleteness Theorem | Edward Frenkel and Lex Fridman

Do LLMs Understand? AI Pioneer Yann LeCun Spars with DeepMind’s Adam Brown.
▶︎

Do LLMs Understand? AI Pioneer Yann LeCun Spars with DeepMind’s Adam Brown.

Mechanistic Interpretability for NLP: One-stop Guide for Everything you Need to Know
▶︎

Mechanistic Interpretability for NLP: One-stop Guide for Everything you Need to Know

Interpretability: Understanding how AI models think
▶︎

Interpretability: Understanding how AI models think

Dark matter explained by physicist | Don Lincoln and Lex Fridman
▶︎

Dark matter explained by physicist | Don Lincoln and Lex Fridman

Hacking LLMs: An Introduction to Mechanistic Interpretability — Jenny Vega
▶︎

Hacking LLMs: An Introduction to Mechanistic Interpretability — Jenny Vega

Meet the Former CIA Agent Who Wants to Abolish the CIA
▶︎

Meet the Former CIA Agent Who Wants to Abolish the CIA

Building Anthropic | A conversation with our co-founders
▶︎

Building Anthropic | A conversation with our co-founders

We think this pattern continues forever, but can't prove it
▶︎

We think this pattern continues forever, but can't prove it

The Hardest Questions in Physics | World Science Festival
▶︎

The Hardest Questions in Physics | World Science Festival

Neel Nanda – Mechanistic Interpretability: A Whirlwind Tour
▶︎

Neel Nanda – Mechanistic Interpretability: A Whirlwind Tour

What Matters Right Now In Mechanistic Interpretability?
▶︎

What Matters Right Now In Mechanistic Interpretability?

Christopher Olah: Anthropic’s Core Views on AI SafetyChristopher Olah
▶︎

Christopher Olah: Anthropic’s Core Views on AI SafetyChristopher Olah