GPUs in Kubernetes for AI Workloads
Today we dive into running AI models on Kubernetes with GPU support. Learn how to manage GPUs in Kubernetes clusters, create GPU nodes, and optimize resource usage without breaking the bank. We'll walk you through setting up a Google Cloud Kubernetes cluster (the same logic should apply to other Cloud providers), deploying AI models like Ollama's Llama2, and handling GPU partitioning. Watch now to master GPU-based AI workloads in Kubernetes! #Kubernetes #GPU #AI ▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬ Sponsor: CAST AI 🔗 https://cast.ai/devopstoolkit ▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬ Consider joining the channel: / devopstoolkit ▬▬▬▬▬▬ 🔗 Additional Info 🔗 ▬▬▬▬▬▬ ➡ Transcript and commands: https://devopstoolkit.live/ai/unlock-... ▬▬▬▬▬▬ 💰 Sponsorships 💰 ▬▬▬▬▬▬ If you are interested in sponsoring this channel, please visit https://devopstoolkit.live/sponsor for more information. Alternatively, feel free to contact me over Twitter or LinkedIn (see below). ▬▬▬▬▬▬ 👋 Contact me 👋 ▬▬▬▬▬▬ ➡ Twitter: / vfarcic ➡ LinkedIn: / viktorfarcic ▬▬▬▬▬▬ 🚀 Other Channels 🚀 ▬▬▬▬▬▬ 🎤 Podcast: https://www.devopsparadox.com/ 💬 Live streams: / devopsparadox ▬▬▬▬▬▬ ⏱ Timecodes ⏱ ▬▬▬▬▬▬ 00:00 AI Inference with GPUs 01:30 CAST AI (sponsor) 02:29 Using GPUs for AI Inference in Kubernetes

Building a GPU cluster for AI

vLLM on Kubernetes in Production

Stop Using Docker Wrong — Podman & Rancher in 2026
![Kubernetes Crash Course for Absolute Beginners [NEW]](https://i.ytimg.com/vi/s_o8dwzRlu4/hqdefault.jpg?sqp=-oaymwEjCNACELwBSFryq4qpAxUIARUAAAAAGAElAADIQj0AgKJDeAE=&rs=AOn4CLAfg4KRReNtQkLAjORAuzDyyoaBFg)
Kubernetes Crash Course for Absolute Beginners [NEW]

Understanding GPU Resources in Kubernetes

How to Design a GPU Cluster for AI Training - The Deep Learning System Design Interview

Mastering GPU Management in Kubernetes Using the Operator Pattern- Shiva Krishna Merla & Kevin Klues

MCP vs CLI: I Tested Both and Here's What Won

Kubernetes Zero to Hero: The Complete Beginner’s Guide (2025 Edition)

Explain How Kubernetes Works With GPU Like I’m 5 - Carlos Santana, AWS

Kubernetes Controllers Deep Dive: How They Really Work

AI in Kubernetes: How to Get Started?

Stop Losing Requests! Learn Graceful Shutdown Techniques

The End of Infrastructure-as-Code: AI Changes Everything... Maybe...

Mastering Kubernetes: Service and Network APIs (Service, Ingress, GatewayAPI)

Kubernetes for AI: Pass-Through GPU in a Linux Machine

Mastering Kubernetes: Workloads APIs (Deployment, StatefulSet, ReplicaSet, Pod, etc.)

Unleashing WebAssembly in Kubernetes with Kwasm

Keynote: Accelerating AI Workloads with GPUs in Kubernetes - Kevin Klues & Sanjay Chatterjee

