Stanford CME296 Diffusion & Large Vision Models | Spring 2026 | Lecture 5 - Architectures

Learn more details about this course: https://online.stanford.edu/courses/c... To follow along with the course schedule and syllabus, visit: https://cme296.stanford.edu/syllabus/ Chapters: 00:00:00 Introduction 00:05:26 Objective 00:09:58 Convolutions, filters 00:14:44 Receptive field 00:17:14 Pooling 00:19:06 U-Net 00:27:52 Timestep representation 00:30:31 Class label representation 00:33:21 Timeline of U-Net models 00:35:43 Diffusion Transformer (DiT) 00:48:08 Adaptive layer normalization (adaLN) 01:02:30 DiT end-to-end example 01:12:57 Multimodal DiT (MM-DiT) 01:23:33 Qwen-Image, Z-Image, FLUX.1 01:24:27 Timeline of DiT models 01:25:25 Absolute position embeddings 01:38:48 Rotary position embeddings (RoPE) 01:39:59 2D RoPE variants For more information about Stanford’s graduate programs, visit: https://online.stanford.edu/graduate-... Afshine Amidi is an Adjunct Lecturer at Stanford University. Shervine Amidi is an Adjunct Lecturer at Stanford University. View the course playlist:    • Stanford CME296: Diffusion & Large Vision ...  

Stanford CME296 Diffusion & Large Vision Models | Spring 2026 | Lecture 6 - Model Training
▶︎

Stanford CME296 Diffusion & Large Vision Models | Spring 2026 | Lecture 6 - Model Training

Stanford CME296 Diffusion & Large Vision Models | Spring 2026 | Lecture 4 - Latent Space & Guidance
▶︎

Stanford CME296 Diffusion & Large Vision Models | Spring 2026 | Lecture 4 - Latent Space & Guidance

Yann LeCun: World Models: Enabling the next AI revolution
▶︎

Yann LeCun: World Models: Enabling the next AI revolution

RL for Agents Workshop - Deep Dive on Training Agents with RL and Open Source
▶︎

RL for Agents Workshop - Deep Dive on Training Agents with RL and Open Source

Stanford CME296 Diffusion & Large Vision Models | Spring 2026 | Lecture 1 - Diffusion
▶︎

Stanford CME296 Diffusion & Large Vision Models | Spring 2026 | Lecture 1 - Diffusion

AI Is Creating A Rare Opportunity For Investors. How Jim Roppel Is Playing It. | Investing With IBD
▶︎

AI Is Creating A Rare Opportunity For Investors. How Jim Roppel Is Playing It. | Investing With IBD

Read The Korea Economic Daily in 30 Minutes | 20260511🌞#MorningRoutine
▶︎

Read The Korea Economic Daily in 30 Minutes | 20260511🌞#MorningRoutine

Live Q&A with Brian Greene | World Science Festival
▶︎

Live Q&A with Brian Greene | World Science Festival

ACLS Drugs Review with Nurse Eunice  📚💉
▶︎

ACLS Drugs Review with Nurse Eunice 📚💉

🩸 Phlebotomy Technician Practice Quiz – with Nurse Eunice! 🎯
▶︎

🩸 Phlebotomy Technician Practice Quiz – with Nurse Eunice! 🎯

How to increase your vocabulary: Live English Class
▶︎

How to increase your vocabulary: Live English Class

Linus Torvalds: AI Can’t Think Like a Programmer
▶︎

Linus Torvalds: AI Can’t Think Like a Programmer

Stanford MS&E435 Economics of the AI Supercycle | Spring 2026 | Building AI Factories
▶︎

Stanford MS&E435 Economics of the AI Supercycle | Spring 2026 | Building AI Factories

Südafrika – Südkorea Highlights | Gruppe A, FIFA WM 2026 | sportstudio
▶︎

Südafrika – Südkorea Highlights | Gruppe A, FIFA WM 2026 | sportstudio

KONTRA #27 Rymanowski, Bartosiak, Bosak: Co dalej z Ukrainą?
▶︎

KONTRA #27 Rymanowski, Bartosiak, Bosak: Co dalej z Ukrainą?

Politics Chat, June 23, 2026
▶︎

Politics Chat, June 23, 2026

Stanford CME296 Diffusion & Large Vision Models | Spring 2026 | Lecture 3 - Flow matching
▶︎

Stanford CME296 Diffusion & Large Vision Models | Spring 2026 | Lecture 3 - Flow matching

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan
▶︎

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan

Harvard Professor: CS50, What Matters More Than Programming Now, Lecturing Well | David J Malan
▶︎

Harvard Professor: CS50, What Matters More Than Programming Now, Lecturing Well | David J Malan

Efficient General Intelligence with Novel Model and Customized Silicon Co-Design, Jason Cong (UCLA)
▶︎

Efficient General Intelligence with Novel Model and Customized Silicon Co-Design, Jason Cong (UCLA)