user testing the user tester - latent space ai in action presentation

Your coding agent compiles, runs every test, and reads its own traces. Tests only check that you built the spec, never that the spec was right, and the one thing the agent can't do is be your user: someone who doesn't care about your mission, arrives with their own goal, and takes a path you never thought to test. Authoring that path is the hard part. A synthetic user just walks it. So I ran the experiment twice. First, head to head with practicing UX researchers: same product, same brief, graded on accuracy and usefulness, with the researchers on record about whether synthetic research is ready to trust. Then against reality: synthetic feedback running beside a product's real user reports, the kind aggregated from every forum, Discord, and support thread it has. Does it reproduce the issues those users already found, and can it surface the next ones before a single user has to? That answer is the talk.

CLAUDE CODE ADVANCED FULL COURSE (3 HOURS)

CLAUDE CODE ADVANCED FULL COURSE (3 HOURS)

The Best Local Agentic Coding Workflow (Complete Guide)

The Best Local Agentic Coding Workflow (Complete Guide)

Get Claude Opus 4.8 and GPT 5.5 Pro Completely FREE! 🔥

Get Claude Opus 4.8 and GPT 5.5 Pro Completely FREE! 🔥

Why Aliens Would NEVER Invade Africa

Why Aliens Would NEVER Invade Africa

Machines, Learning, and Machine Learning - Dylan Beattie - NDC Copenhagen 2026

Machines, Learning, and Machine Learning - Dylan Beattie - NDC Copenhagen 2026

Using Large Language Models | Build Your Own LLM Workshop #1

Using Large Language Models | Build Your Own LLM Workshop #1

Build a Full-Stack GenAI Project in 4 Hours (FastAPI, React, Supabase)

Build a Full-Stack GenAI Project in 4 Hours (FastAPI, React, Supabase)

Keynote: After the AI Hype – What’s Real, and What’s Next - Richard Campbell - 2026

Keynote: After the AI Hype – What’s Real, and What’s Next - Richard Campbell - 2026

I Made Opus 4.8 and Fable 5 Build the Same App (RAW RESULTS)

I Made Opus 4.8 and Fable 5 Build the Same App (RAW RESULTS)

Claude Architect: Multi-Agent Orchestration

Claude Architect: Multi-Agent Orchestration

Don't learn AI Agents without Learning these Fundamentals

Don't learn AI Agents without Learning these Fundamentals

OpenAI’s $1 Trillion Bullsh*t Is Falling Apart

OpenAI’s $1 Trillion Bullsh*t Is Falling Apart

God Says:"TAKE THIS MESSAGE SERIOUSLY, BECAUSE ONLY YOU ARE SEEING IT"/God Message Now/God Message

God Says:"TAKE THIS MESSAGE SERIOUSLY, BECAUSE ONLY YOU ARE SEEING IT"/God Message Now/God Message

Full Walkthrough: Writing & Using Skills — Nick Nisi and Zack Proser

Full Walkthrough: Writing & Using Skills — Nick Nisi and Zack Proser

Head of Claude Code: What happens after coding is solved | Boris Cherny

Head of Claude Code: What happens after coding is solved | Boris Cherny

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan

How AI agents & Claude skills work (Clearly Explained)

How AI agents & Claude skills work (Clearly Explained)

Skill Issue: Andrej Karpathy on Code Agents, AutoResearch, and the Loopy Era of AI

Skill Issue: Andrej Karpathy on Code Agents, AutoResearch, and the Loopy Era of AI

How To Think SO CLEARLY People Assume You're A Genius

How To Think SO CLEARLY People Assume You're A Genius

Why AI Agents are either the best or worst thing we’ve ever built

Why AI Agents are either the best or worst thing we’ve ever built