
▶︎
SIGCSE TS 2026 - Saturday Keynote: "CS and SE Education, post-AI" by Titus Winters (Adobe)

▶︎
MultiGPU + NCCL from the authors

▶︎
tpu

▶︎
Lecture 36: CUTLASS and Flash Attention 3

▶︎
Demystifying NCCL An In depth Analysis of GPU Communication Protocols and Algorithms - Zhiyi Hu

▶︎
I Benchmarked vLLM vs SGLang So You Don't Have To Shocking Results!

▶︎
What is CUDA? - Computerphile

▶︎
Lecture 16: On Hands Profiling

▶︎
Learn RDMA Programming: NVIDIA’s Guide to High-Performance Networking

▶︎
Lecture 67: NCCL and NVSHMEM

▶︎
Scaling RoCE Networks for AI Training | Adi Gangidi

▶︎
Lecture 8: CUDA Performance Checklist

▶︎
Understanding the LLM Inference Workload - Mark Moyou, NVIDIA

▶︎
A friendly introduction to distributed training (ML Tech Talks)

▶︎
Turing Award Winner: Disagreeing with Google, Postgres, Future Problems | Mike Stonebraker

▶︎
Zig 2026: No-AI Policy, $670K Foundation, Left GitHub & Why Zig Isn’t 1.0 - Andrew Kelley Explains

▶︎
Lecture 14: Practitioners Guide to Triton

▶︎
Distributed Training with PyTorch: complete tutorial with cloud infrastructure and code

▶︎
Lecture 23: Tensor Cores

▶︎
