Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 10: Inference

For more information about Stanford's online Artificial Intelligence programs visit: https://stanford.io/ai To learn more about enrolling in this course visit: https://online.stanford.edu/courses/c... To follow along with the course schedule and syllabus visit: https://stanford-cs336.github.io/spri... Percy Liang Associate Professor of Computer Science Director of Center for Research on Foundation Models (CRFM) Tatsunori Hashimoto Assistant Professor of Computer Science View the entire course playlist:    • Stanford CS336 Language Modeling from Scra...  

Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 11: Scaling laws 2
▶︎

Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 11: Scaling laws 2

Why Inference is hard..
▶︎

Why Inference is hard..

Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 7: Parallelism 1
▶︎

Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 7: Parallelism 1

Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 9: Scaling laws 1
▶︎

Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 9: Scaling laws 1

Understanding the LLM Inference Workload - Mark Moyou, NVIDIA
▶︎

Understanding the LLM Inference Workload - Mark Moyou, NVIDIA

Causal Mechanistic Interpretability (Stanford lecture 1) - Atticus Geiger
▶︎

Causal Mechanistic Interpretability (Stanford lecture 1) - Atticus Geiger

Terence Tao: Nobody Understands Why AI Actually Works
▶︎

Terence Tao: Nobody Understands Why AI Actually Works

"First Proof: Mathematicians Putting AI to the Test" March 14, 2026
▶︎

"First Proof: Mathematicians Putting AI to the Test" March 14, 2026

Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 1: Overview and Tokenization
▶︎

Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 1: Overview and Tokenization

Turing Award Winner: Disagreeing with Google, Postgres, Future Problems | Mike Stonebraker
▶︎

Turing Award Winner: Disagreeing with Google, Postgres, Future Problems | Mike Stonebraker

Stanford CS336 Language Modeling from Scratch | Spring 2026 | Lecture 3: Architectures
▶︎

Stanford CS336 Language Modeling from Scratch | Spring 2026 | Lecture 3: Architectures

Faster LLMs: Accelerate Inference with Speculative Decoding
▶︎

Faster LLMs: Accelerate Inference with Speculative Decoding

"A.I. and Our Economic Future," Professor Chad Jones
▶︎

"A.I. and Our Economic Future," Professor Chad Jones

Stanford CS336 Language Modeling from Scratch | Spring 2026 | Lecture 1: Overview, Tokenization
▶︎

Stanford CS336 Language Modeling from Scratch | Spring 2026 | Lecture 1: Overview, Tokenization

Creator of C++: Bell Labs, Negative Overhead Abstraction, Mistakes | Bjarne Stroustrup
▶︎

Creator of C++: Bell Labs, Negative Overhead Abstraction, Mistakes | Bjarne Stroustrup

Stanford CS336 Language Modeling from Scratch | Spring 2026 | Lecture 15: Mid/Post-Training
▶︎

Stanford CS336 Language Modeling from Scratch | Spring 2026 | Lecture 15: Mid/Post-Training

Trump Gets Booed & Falls Asleep During NBA Finals, Claims War is Almost Over & Goodbye Spencer Pratt
▶︎

Trump Gets Booed & Falls Asleep During NBA Finals, Claims War is Almost Over & Goodbye Spencer Pratt

Stanford CS336 Lang. Modeling from Scratch | Spring 2025 | Lec. 3: Architectures, Hyperparameters
▶︎

Stanford CS336 Lang. Modeling from Scratch | Spring 2025 | Lec. 3: Architectures, Hyperparameters

Something is jamming GPS over Europe. Here's what we found
▶︎

Something is jamming GPS over Europe. Here's what we found