Understanding NVIDIA GPU Hardware as a CUDA C Programmer | Episode 2: GPU Compute Architecture
NVIDIA GPU hardware from the CUDA C programmer's point of view. Video Notes: https://0mean1sigma.com/chapter-3-gpu... Code Repository: https://github.com/tgautam03/CUDA-C Animations: https://github.com/tgautam03/0Mean1Si... 00:00 - Introduction 00:50 - GPU Hardware 02:50 - Warps 04:55 - Latency Tolerance 07:01 - Conclusion

▶︎
Modern GPU Architecture | GPU Programming

▶︎
How LLMs use multiple GPUs

▶︎
DAY 30 - Merge Two Sorted Linked Lists

▶︎
4.5x Faster CUDA C with just Two Variable Changes || Episode 3: Memory Coalescing

▶︎
Getting Started with CUDA and Parallel Programming | NVIDIA GTC 2025 Session

▶︎
GPU Pipeline Optimization Explained | Async UDFs, CUDA Streams & Pinned Memory

▶︎
Give Me 30 min, I'll Make CUDA Click Forever

▶︎
2678x Faster with CUDA C: Simple Matrix Multiplication on a GPU | Episode 1: Introduction to GPGPU

▶︎
Stanford CS149 I Parallel Computing I 2023 I Lecture 7 - GPU architecture and CUDA Programming

▶︎
What is CUDA? - Computerphile

▶︎
How to write a fast Softmax kernel

▶︎
Memory Hierarchy | GPU Programming | Episode 6

▶︎
NVIDIA Monopoly is DEAD | OPEN-SOURCE Chips Are HERE!

▶︎
Zig 2026: No-AI Policy, $670K Foundation, Left GitHub & Why Zig Isn’t 1.0 - Andrew Kelley Explains

▶︎
Every Level of Reverse Engineering Explained

▶︎
Introduction | GPU Programming | Episode 0

▶︎
GPUs: Explained

▶︎
GPU Warps Explained: How SIMT Really Works Under the Hood (Visual Deep Dive) | M2L3

▶︎
How do Graphics Cards Work? Exploring GPU Architecture

▶︎
