The State of AI 2025: Insights from Nathan Benaich at Air Street Capital #stateofai #ai

The State of AI Report is the most widely read and trusted analysis of key developments in AI. Published annually since 2018, the open-access report aims to spark informed conversation about the state of AI and what it means for the future. Produced by AI investor Nathan Benaich and Air Street Capital. In short, it’s been a monumental 12 months for AI. Our eighth annual report is the most comprehensive it’s ever been, covering what you need to know about research, industry, politics, and safety - along with our first State of AI Usage Survey of 1,200 practitioners. The State of AI Report has become the most widely read and trusted report on AI progress and is our open-access contribution to the AI ecosystem. In true State of AI Report style, let’s dive into the Research section. 12 months on, OpenAI still leads, but the pack has closed in fast. China’s DeepSeek, Qwen, and Kimi sit within a few points on reasoning and coding. While the US holds the frontier, China is now a credible and popular #2. Once a “Llama rip-off,” Qwen now powers 40% of all new fine-tunes on Hugging Face. China’s open-weights ecosystem has overtaken Meta’s, with Llama riding off into the sunset…for now. Reinforcement learning has grown up. After fuzzy human feedback came rubric-based rewards and verifiable reasoning tasks. We’re rediscovering rigor and environments for agents to undertake long-running tasks is all the rage. What’s this approach enabling? OpenAI and Gemini models both hit math Olympiad gold. Open provers like Gödel-LM are publishing formal proofs, showing that AI-assisted theorem proving is no longer science fiction. But we’re not just creating superintelligent agents to crush humans. Indeed, AlphaZero-discovered strategies improved the gameplay of four chess Grandmasters, proving that superhuman systems can teach the very best humans, not just beat them. AI is now a lab partner too. DeepMind’s Co-Scientist and Stanford’s Virtual Lab generate, debate, and validate hypotheses, discover new and established ideas as science is becoming a closed loop with AI in it. Biology gets its scaling laws too. Profluent’s ProGen3 trained on 1.5T tokens and created a compute frontier for protein language models. This is unlocking generalisation in novel protein space and a path to novel therapeutics such as custom gene editors. Robots now reason too. “Chain-of-Action” planning brings structured thought to the physical world - from AI2’s Molmo-Act to Gemini Robotics. Massive amounts of effort are thrown into the mix, expect lots of progress here… Anthropic’s Model Context Protocol is the new USB-C of AI. A single standard to connect models to tools, already embedded in ChatGPT, Gemini, Claude, and VS Code, has taken shape. But not without emerging security risks… Now, onto the Industry section. RIP AGI, long live Superintelligence. AI pilled tech executives have rebranded the mission, and it’s working: provocative, undefined, and exciting. The frontier fight is relentless. OpenAI still tops most leaderboards, but DeepMind’s stays there longer. Timing releases have become its own science…not least informing financing rounds like clockwork. Capability per dollar is doubling every few months. Google’s rate: 3.4 months. OpenAI’s: 5.8 months. More predictable gains are driving more investment, and more intelligence for less money. AI software adoption has gone mainstream. Ramp data shows 44% of US businesses now pay for AI, up from 5% in 2023. Average contract value for AI products hit $530k in 2025 and is expected to pass $1M in 2026. 12 month retention is now 80%+. AI-first companies still outrun everyone else too, growing 1.5x faster than peers on Standard Metrics. DeepSeek’s “$5M training run” deep freak was overblown. Since the market realised the fine print in the R1 paper, that’s led to Jevons paradox on steroids: lower cost per run, more runs, more compute needed, buy more NVIDIA. Enter Stargate: a $500B, 10GW US mega-cluster (4M chips) backed by Altman, Masa, Ellison, and Trump. The industrial era of AI begins. What a time to be alive. Sovereigns join the race: from China’s $5B Big Fund to the UAE’s MGX, nations are writing cheques to stay in the game. We expect some nations to just tap out and declare neutrality. China leads in power infrastructure too, adding over 400GW in 2024 vs 41GW for the US. Compute now clearly runs on geopolitics. The AI Safety Institute network has collapsed. Washington ditched attending meetings altogether, while the US and UK rebranded “safety” into “security.” Europe’s AI Act is wobbling: only 3 states are compliant, leaders calling it “confusing,” and pressure mounting for a pause as it’s clear the continent is being left behind. China’s spending through the debt as Xi told ministers to “redouble efforts” on AI, boosting science funding 10% despite record debt.

The Minds of Modern AI: Jensen Huang, Geoffrey Hinton, Yann LeCun & the AI Vision of the Future

The Minds of Modern AI: Jensen Huang, Geoffrey Hinton, Yann LeCun & the AI Vision of the Future

Andrej Karpathy: Software Is Changing (Again)

Andrej Karpathy: Software Is Changing (Again)

The Limits of AI: Generative AI, NLP, AGI, & What’s Next?

The Limits of AI: Generative AI, NLP, AGI, & What’s Next?

Keynote: After the AI Hype – What’s Real, and What’s Next - Richard Campbell - 2026

Keynote: After the AI Hype – What’s Real, and What’s Next - Richard Campbell - 2026

The State of AI 2024: Insights from Nathan Benaich at Air Street Capital #stateofai #ai

The State of AI 2024: Insights from Nathan Benaich at Air Street Capital #stateofai #ai

The arrival of AGI | Shane Legg (co-founder of DeepMind)

The arrival of AGI | Shane Legg (co-founder of DeepMind)

Inside Anthropic, the $965 Billion AI Juggernaut | The Circuit

Inside Anthropic, the $965 Billion AI Juggernaut | The Circuit

Will AI outsmart human intelligence? - with 'Godfather of AI' Geoffrey Hinton

Will AI outsmart human intelligence? - with 'Godfather of AI' Geoffrey Hinton

FULL DISCUSSION: Google's Demis Hassabis, Anthropic's Dario Amodei Debate the World After AGI | AI1G

FULL DISCUSSION: Google's Demis Hassabis, Anthropic's Dario Amodei Debate the World After AGI | AI1G

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan

Is AI’s Circular Financing Inflating a Bubble?

Is AI’s Circular Financing Inflating a Bubble?

Andrew Ng: State of AI Agents | LangChain Interrupt

Andrew Ng: State of AI Agents | LangChain Interrupt

Greg Brockman: AGI, Sora 2, Bottlenecks, White Collar, Proactive AI, and more!

Greg Brockman: AGI, Sora 2, Bottlenecks, White Collar, Proactive AI, and more!

Marc Andreessen and Ben Horowitz on the State of AI

Marc Andreessen and Ben Horowitz on the State of AI

Model Context Protocol (MCP) Explained for Beginners: AI Flight Booking Demo!

Model Context Protocol (MCP) Explained for Beginners: AI Flight Booking Demo!

Andrew Ng: Building Faster with AI

Andrew Ng: Building Faster with AI

Inside Google DeepMind: AGI, Robotics, & World Models Explained - Demis Hassabis

Inside Google DeepMind: AGI, Robotics, & World Models Explained - Demis Hassabis

The Biggest Pivot In AI History Is Happening Right Now

The Biggest Pivot In AI History Is Happening Right Now

OpenClaw Creator: Why 80% Of Apps Will Disappear

OpenClaw Creator: Why 80% Of Apps Will Disappear

'Musk Will Get Richer, People Will Get Unemployed': Nobel Laureate Hinton on AI

'Musk Will Get Richer, People Will Get Unemployed': Nobel Laureate Hinton on AI