AI Engineering Workflows, Benchmarks, and the Future of Coding Agents | swyx

AI engineering workflows are evolving fast. swyx (AI.Engineer) breaks down agentic code benchmarks, why 50% of SweBench code is unmergeable, how agents are breaking GitHub, and why all the AI labs are IPO-ing at the same time. Chapters below. Subscribe for more AI:AM conversations. 0:00 AI insiders are selling. 0:27 $300K vs $5M for same job. 0:49 OpenAI is the Apple of AI. 1:37 Intro: Who is swyx 4:04 AI Engineer World's Fair themes 7:24 Continual learning: Weights vs systems 10:51 Enterprise AI: Cheap, perfect, private 14:45 Startups vs enterprises: Capability vs cost 17:38 FrontierCode: A new AI coding benchmark 23:15 Preventing benchmark saturation 25:43 Slop code, human taste, and Move 37 30:13 Claude Opus vs Fable: Cost vs capability 32:05 The advisor model and model routing 36:28 Convergence and market segments in AI 44:15 Rebuilding cloud infrastructure for agents 51:46 Vibe coding internal SaaS replacements 57:22 Whoever owns the system of record wins 59:55 The AI IPO bubble and insider selling 1:04:49 Solving Star Trek problems after the IPO 1:14:07 Career advice for CS grads in the AI era 1:19:50 AI Engineer World's Fair 2026 Full episode: • swyx: How AI Agents Are Rewriting Software Guest links: swyx: https://x.com/swyx | / shawnswyxwang AI:AM links: 𝕏 AI:AM: https://x.com/ai_in_the_am 𝕏 Prakash: https://x.com/8teapi 𝕏 Nathan: https://x.com/labenz Website: https://ai-in-the-am.com #AIEngineering #CodingAgents #AIBenchmarks

CLAUDE CODE MASTERCLASS 4 HOURS: Build & Sell (2026)

CLAUDE CODE MASTERCLASS 4 HOURS: Build & Sell (2026)

L'Agentic Coding, nouveau territoire du Platform Engineering

L'Agentic Coding, nouveau territoire du Platform Engineering

How to 10x Your Value in the A.I. Era | ft. Kunal Shah

How to 10x Your Value in the A.I. Era | ft. Kunal Shah

Software architecture, human judgment, and AI's limits with Grady Booch

Software architecture, human judgment, and AI's limits with Grady Booch

DeepMind Chief Demis Hassabis Says Google’s Still Winning AI Talent | Semafor Tech

DeepMind Chief Demis Hassabis Says Google’s Still Winning AI Talent | Semafor Tech

swyx: How AI Agents Are Rewriting Software

swyx: How AI Agents Are Rewriting Software

Skill Issue: Andrej Karpathy on Code Agents, AutoResearch, and the Loopy Era of AI

Skill Issue: Andrej Karpathy on Code Agents, AutoResearch, and the Loopy Era of AI

The Moment That Changed Software Development!

The Moment That Changed Software Development!

ASP.NET Community Standup: Better AI for .NET developers with dotnet/skills

ASP.NET Community Standup: Better AI for .NET developers with dotnet/skills

The Future of AI Agents with Andrew Ng | Interrupt 26

The Future of AI Agents with Andrew Ng | Interrupt 26

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan

Japan – Schweden Highlights | Gruppe F, FIFA WM 2026 | sportstudio

Japan – Schweden Highlights | Gruppe F, FIFA WM 2026 | sportstudio

Power Automate Beginner to Pro Tutorial [Full Course]

Power Automate Beginner to Pro Tutorial [Full Course]

How Elite Software Engineers Are Using Agents to Get Sh*t Done

How Elite Software Engineers Are Using Agents to Get Sh*t Done

Europe's energy scam is worse than you think - Yanis Varoufakis & Wolfgang Munchau | The Econoclasts

Europe's energy scam is worse than you think - Yanis Varoufakis & Wolfgang Munchau | The Econoclasts

John Groetzinger - Skills Everywhere: AI Native DevCon London 2026

John Groetzinger - Skills Everywhere: AI Native DevCon London 2026

Stop Prompting Claude. Use Karpathy's Method Instead.

Stop Prompting Claude. Use Karpathy's Method Instead.

Keynote: After the AI Hype – What’s Real, and What’s Next - Richard Campbell - 2026

Keynote: After the AI Hype – What’s Real, and What’s Next - Richard Campbell - 2026

Mitchell Hashimoto’s new way of writing code

Mitchell Hashimoto’s new way of writing code

Google DeepMind Distinguished Eng (L9): How To Land a Job at a Frontier Lab | Vlad Feinberg

Google DeepMind Distinguished Eng (L9): How To Land a Job at a Frontier Lab | Vlad Feinberg