Demystifying NCCL An In depth Analysis of GPU Communication Protocols and Algorithms - Zhiyi Hu

Zhiyi Hu, Siyuan Shen, Tommaso Bonato (ETH Zurich), Sylvain Jeaugey (NVIDIA), Cedell Alexander, Eric Spada (Broadcom), James Dinan, Jeff Hammond (NVIDIA) and Torsten Hoefler (ETH Zurich)

Building Custom AI Infrastructure with NVLink Fusion - Krishnan Geeyarpuram
▶︎

Building Custom AI Infrastructure with NVLink Fusion - Krishnan Geeyarpuram

Multi-GPU Communication Libraries for Scaling HPC and AI Workloads | NVIDIA GTC 2025
▶︎

Multi-GPU Communication Libraries for Scaling HPC and AI Workloads | NVIDIA GTC 2025

Broad overview of privacy/security issues in the ML space
▶︎

Broad overview of privacy/security issues in the ML space

NCCL: High-Speed Inter-GPU Communication for Large-Scale Training - Sylvain Jeaugey, NVIDIA
▶︎

NCCL: High-Speed Inter-GPU Communication for Large-Scale Training - Sylvain Jeaugey, NVIDIA

Ultra Ethernet for next-generation AI and HPC workloads
▶︎

Ultra Ethernet for next-generation AI and HPC workloads

What Nobody Tells You About Being a Quant
▶︎

What Nobody Tells You About Being a Quant

Mastering LLM Inference Optimization From Theory to Cost Effective Deployment: Mark Moyou
▶︎

Mastering LLM Inference Optimization From Theory to Cost Effective Deployment: Mark Moyou

Getting Started with CUDA and Parallel Programming | NVIDIA GTC 2025 Session
▶︎

Getting Started with CUDA and Parallel Programming | NVIDIA GTC 2025 Session

Explain How Kubernetes Works With GPU Like I’m 5 - Carlos Santana, AWS
▶︎

Explain How Kubernetes Works With GPU Like I’m 5 - Carlos Santana, AWS

Inside the Mind of Anthropic CEO Dario Amodei | The Circuit | Extended Interview
▶︎

Inside the Mind of Anthropic CEO Dario Amodei | The Circuit | Extended Interview

How is hardware reshaping LLM design?
▶︎

How is hardware reshaping LLM design?

Creator of C++: Bell Labs, Negative Overhead Abstraction, Mistakes | Bjarne Stroustrup
▶︎

Creator of C++: Bell Labs, Negative Overhead Abstraction, Mistakes | Bjarne Stroustrup

How Nvidia GPUs Compare To Google’s And Amazon’s AI Chips
▶︎

How Nvidia GPUs Compare To Google’s And Amazon’s AI Chips

Accelerating Frontier MoE Training with 3D Integrated Optics - Taylor Groves
▶︎

Accelerating Frontier MoE Training with 3D Integrated Optics - Taylor Groves

Tutorial: GPU Communication Libraries for Accelerating HPC and AI Applications
▶︎

Tutorial: GPU Communication Libraries for Accelerating HPC and AI Applications

Keynote: After the AI Hype – What’s Real, and What’s Next - Richard Campbell - 2026
▶︎

Keynote: After the AI Hype – What’s Real, and What’s Next - Richard Campbell - 2026

The Genius of Computing with Light
▶︎

The Genius of Computing with Light

NVIDIA CEO Jensen Huang's Vision for the Future
▶︎

NVIDIA CEO Jensen Huang's Vision for the Future

Lecture 67: NCCL and NVSHMEM
▶︎

Lecture 67: NCCL and NVSHMEM

Stanford CS153 Frontier Systems | Jensen Huang from NVIDIA on the Compute Behind Intelligence
▶︎

Stanford CS153 Frontier Systems | Jensen Huang from NVIDIA on the Compute Behind Intelligence