Harnesses in AI: A Deep Dive — Tejas Kumar, IBM
The agent hit a login page, panicked, reported success anyway, and the upvote never happened. Tejas Kumar's diagnosis: not a prompt problem. A harness problem. The demo builds a browser agent on GPT-3.5 Turbo (consciously choosing a VERY old model to show how good harness eng can improve it a lot) against Hacker News and layers in a harness without touching the prompt once. Guardrails cap iterations and compact context. A verify step reads the tool call history to catch the agent lying about what it did. A login handler watches the browser URL each loop and injects credentials programmatically when it hits the login page. By the end the cheap old model reliably logs in and upvotes the post. Speaker info: https://x.com/TejasKumar_ / tejasq https://github.com/TejasQ Timestamps: 0:00 Introduction to Tejas Kumar and AI Harnesses 1:45 Why we use harnesses: Reliability and control 3:00 Defining an agent harness from first principles 4:32 Key components of an agent harness (Tooling, Context, Guardrails) 5:59 Starting the demo: Building a browser agent 7:00 Inspecting the initial agent loop 8:12 The problem: Agent failure and hallucination 10:20 Adding guardrails and context management 11:54 Refactoring into a formal harness 13:02 Implementing a verify step to catch lies 15:36 Implementing a login handler for programmatic access 17:42 Final demonstration: Successful autonomous upvoting 18:34 Summary and the future of dynamic harnesses

How I deleted 95% of my agent skills and got better results — Nick Nisi, WorkOS

The Multi-Agent Architecture That Actually Ships — Luke Alvoeiro, Factory

Five AI Risks That Can Get You Fired—And How to Avoid Them

What Is AI Harness Exactly?

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan

What is an Agent Harness? and How to build a great one!

How We Build Effective Agents: Barry Zhang, Anthropic

Beyond Components: Designing Generative UI for MCP Apps — Ruben Casas, Postman

Zig 2026: No-AI Policy, $670K Foundation, Left GitHub & Why Zig Isn’t 1.0 - Andrew Kelley Explains

"Software Fundamentals Matter More Than Ever" — Matt Pocock

How Anthropic Engineers ACTUALLY Prompt Claude Code

Harness Engineering: What Separates Top Agentic Engineers Right Now

CLI vs MCP: How AI Agents Choose the Right Tool for the Job

A Conversation with Demis Hassabis, Co-Founder and CEO of Google DeepMind

Context Is the New Code — Patrick Debois, Tessl

Agent Harness explained in 8min..

No Vibes Allowed: Solving Hard Problems in Complex Codebases – Dex Horthy, HumanLayer

How To De-Slop A Codebase Ruined By AI (with one skill)

What AI Agent Skills Are and How They Work

