Everything I Learned Training Frontier Small Models — Maxime Labonne, Liquid AI

A new class of small models is emerging with the ability to reliably follow instructions and call tools while running on-device under 1 GB of memory. In this talk, we'll break down how to post-train frontier small models using the LFM2.5 recipe: on-policy preference alignment, agentic reinforcement learning, and curriculum training with iterative model merging. We'll cover training challenges unique to the 1B scale, like doom loops, capability interference, and how to fix them. The goal is to give you a concrete playbook to fine-tune and deploy small models for your own use cases, from structured data extraction to multi-turn tool use. Speaker info: https://x.com/maximelabonne / maxime-labonne https://github.com/mlabonne Timestamps: 0:00:00 - Start 0:00:14 - Introduction to frontier small models at Liquid AI 0:01:02 - Characteristics: memory-bound, task-specific, latency-sensitive 0:02:20 - Architecture: why large embedding layers are inefficient 0:04:01 - LFM2 architecture: using gated short convolutions for speed 0:06:09 - LFM 2.5 recipe: 28T tokens and post-training stages 0:08:34 - Post-training: SFT, preference alignment, and RL best practices 0:10:43 - Identifying "doom loops" in reasoning models 0:11:34 - Solutions: mitigating loops via preference alignment and RL 0:15:29 - Future focus: using agentic tools to overcome memory limits 0:17:58 - Q&A: real-world applications for small vs. large models

MIT 6.S191 (2025): Large Language Models (Liquid AI)

MIT 6.S191 (2025): Large Language Models (Liquid AI)

Google DeepMind Distinguished Eng (L9): How To Land a Job at a Frontier Lab | Vlad Feinberg

Google DeepMind Distinguished Eng (L9): How To Land a Job at a Frontier Lab | Vlad Feinberg

0 Data Science Understanding: 금융시장의 수학과 공학 | Tutorial Probability Statistics Algorithm DataScience

0 Data Science Understanding: 금융시장의 수학과 공학 | Tutorial Probability Statistics Algorithm DataScience

RAG Crash Course for Beginners

RAG Crash Course for Beginners

Text Diffusion — Brendan O’Donoghue, Google DeepMind

Text Diffusion — Brendan O’Donoghue, Google DeepMind

The future of Security Operations Centers (keynote by security expert Costin Vilcu)

The future of Security Operations Centers (keynote by security expert Costin Vilcu)

Hermes Architecture EXPLAINED: Memory, Context & Gateways

Hermes Architecture EXPLAINED: Memory, Context & Gateways

Linus Torvalds: AI Can’t Think Like a Programmer

Linus Torvalds: AI Can’t Think Like a Programmer

CHOSEN ONE!! YOUR IDENTITY REVEAL JUST SHOOK THE INTERNET... AND THEIR MINDS

CHOSEN ONE!! YOUR IDENTITY REVEAL JUST SHOOK THE INTERNET... AND THEIR MINDS

Ian Proud: Anti-russische Sanktionen wirken nicht – Wie der Ukraine-Krieg endet

Ian Proud: Anti-russische Sanktionen wirken nicht – Wie der Ukraine-Krieg endet

Inference, Diffusion, World Models, and More | YC Paper Club

Inference, Diffusion, World Models, and More | YC Paper Club

Hermes Agent Fundamentals In 29 Minutes

Hermes Agent Fundamentals In 29 Minutes

LLMs Don't Need More Parameters. They Need Loops.

LLMs Don't Need More Parameters. They Need Loops.

RL for Agents Workshop - Deep Dive on Training Agents with RL and Open Source

RL for Agents Workshop - Deep Dive on Training Agents with RL and Open Source

LLM vs. SLM vs. FM: Choosing the Right AI Model

LLM vs. SLM vs. FM: Choosing the Right AI Model

Anthropic's Boris Cherny: Why Coding Is Solved, and What Comes Next

Anthropic's Boris Cherny: Why Coding Is Solved, and What Comes Next

Why Netflix is betting on systems thinkers—not specialists—in the AI era | Elizabeth Stone (CPTO)

Why Netflix is betting on systems thinkers—not specialists—in the AI era | Elizabeth Stone (CPTO)

Is RAG Still Needed? Choosing the Best Approach for LLMs

Is RAG Still Needed? Choosing the Best Approach for LLMs

The Production AI Playbook: Deploying Agents at Enterprise Scale — Sandipan Bhaumik, Databricks

The Production AI Playbook: Deploying Agents at Enterprise Scale — Sandipan Bhaumik, Databricks

Anthropic Workshop: Build Agents That Run for Hours — Ash Prabaker & Andrew Wilson

Anthropic Workshop: Build Agents That Run for Hours — Ash Prabaker & Andrew Wilson