Arjun Guha: How Language Models Model Programming Languages & How Programmers Model Language Models

In this Tech Talk, Arjun Guha takes a deeper look at how LLMs internally reason about programming concepts, and how programmers reason about the programming abilities of LLMs. We first examine how language models perform at the task of predicting type annotations. Although language models can reason about complex type structures, it is also obvious that they often rely on lexical heuristics such as "n is an int" and "s is a string". Can we disentangle genuine reasoning from heuristics at all? Towards this goal, we present evidence that models do in fact learn a robust internal mechanism for type prediction in their hidden layers. We show that this mechanism often fails to activate on out-of-distribution programs, but when we force it to turn on, we can correct many type prediction failures. We also show that this mechanism is shared across programming languages. We then examine how people perform at the task of steering language models on programming tasks. Many programmers, and new programmers in particular, have skill issues when it comes to writing prompts. But, what does it mean to be good at writing programming prompts? When we interview students struggling to prompt models and instructors trying to teach prompting, a common refrain is that new programmers lack the technical vocabulary needed to effectively describe programming tasks. With a causal intervention experiment on a dataset of prompts, we show that this is not the case. Instead, we build an abstraction to study prompt editing at scale. Our work shows that prompts go wrong when they lack details that are essential for the task at hand and have implications for how language models fit into computing education.

A Jane Street Software Engineering Mock Interview with Grace and Nolen

A Jane Street Software Engineering Mock Interview with Grace and Nolen

Bryan Cantrill: Andreessen’s Folly - The False Dichotomy of Software and Hardware

Bryan Cantrill: Andreessen’s Folly - The False Dichotomy of Software and Hardware

Yann LeCun's $1B Bet Against LLMs [Part 1]

Yann LeCun's $1B Bet Against LLMs [Part 1]

How GPT, Claude, and Gemini are actually trained and served – Reiner Pope

How GPT, Claude, and Gemini are actually trained and served – Reiner Pope

Yaron Minsky: Donor Address by Jane Street

Yaron Minsky: Donor Address by Jane Street

Zig 2026: No-AI Policy, $670K Foundation, Left GitHub & Why Zig Isn’t 1.0 - Andrew Kelley Explains

Zig 2026: No-AI Policy, $670K Foundation, Left GitHub & Why Zig Isn’t 1.0 - Andrew Kelley Explains

A Jane Street Trading Mock Interview with Graham and Andrea

A Jane Street Trading Mock Interview with Graham and Andrea

Will Crichton: Rust for Everyone!

Will Crichton: Rust for Everyone!

Andrej Karpathy: Software Is Changing (Again)

Andrej Karpathy: Software Is Changing (Again)

We're 99.9% sure this pattern is true, but no one can prove it

We're 99.9% sure this pattern is true, but no one can prove it

God Says:"I JUST CONFIRMED — ONLY YOU CAN SEE THIS LETTER"/God Message Now/God Message

God Says:"I JUST CONFIRMED — ONLY YOU CAN SEE THIS LETTER"/God Message Now/God Message

How AI Cracked the Protein Folding Code and Won a Nobel Prize

How AI Cracked the Protein Folding Code and Won a Nobel Prize

Yann LeCun: World Models: Enabling the next AI revolution

Yann LeCun: World Models: Enabling the next AI revolution

Richard P. Feynman: Probability and Uncertainty; The Quantum Mechanical View of Nature

Richard P. Feynman: Probability and Uncertainty; The Quantum Mechanical View of Nature

Charlie Marsh: uv: An Extremely Fast Python Package Manager

Charlie Marsh: uv: An Extremely Fast Python Package Manager

ASMR Addictive Fast Tapping Collection For Deep Sleep & Anxiety Relief (No Talking) — 2.5 Hours

ASMR Addictive Fast Tapping Collection For Deep Sleep & Anxiety Relief (No Talking) — 2.5 Hours

Making GPUs Actually Fast: A Deep Dive into Training Performance

Making GPUs Actually Fast: A Deep Dive into Training Performance

Inside Anthropic, the $965 Billion AI Juggernaut | The Circuit

Inside Anthropic, the $965 Billion AI Juggernaut | The Circuit

Creator of C++: Bell Labs, Negative Overhead Abstraction, Mistakes | Bjarne Stroustrup

Creator of C++: Bell Labs, Negative Overhead Abstraction, Mistakes | Bjarne Stroustrup

How To Think SO CLEARLY People Assume You're A Genius

How To Think SO CLEARLY People Assume You're A Genius