From Pull To Predict: Accelerating AI Model Deployment on Kubernetes - Lucas Duarte & Tiago Reichert

Don't miss out! Join us at our next Flagship Conference: KubeCon + CloudNativeCon events in Amsterdam, The Netherlands (23-26 March, 2026). Connect with our current graduated, incubating, and sandbox projects as the community gathers to further the education and advancement of cloud native computing. Learn more at https://kubecon.io From Pull To Predict: Accelerating AI Model Deployment on Kubernetes - Lucas Duarte & Tiago Reichert, AWS In the era of large AI models, deployment latency and resource utilization present significant challenges for Kubernetes operators. This session demonstrates techniques to reduce model startup times and optimize cluster resources. We'll deploy a 7B parameter LLM using Ray and vLLM for scaling and serving, implementing three key optimizations: SOCI (Seekable OCI) for lazy loading of container images, enabling containers to start without downloading the entire image first; an optimized storage layer that keeps models pre-downloaded and ready for quick access; and intelligent node provisioning using Karpenter for dynamic resource allocation. We'll compare a standard deployment against one using these optimizations, showing the differences in startup times, resource usage, and operational costs. Attendees will learn implementation steps for these techniques, which they can apply to their own Kubernetes environments to improve AI model deployment efficiency.

Scaling and Securing CoreDNS: Performance and Resilience - Yong Tang & John Belamaric
▶︎

Scaling and Securing CoreDNS: Performance and Resilience - Yong Tang & John Belamaric

Kubernetes Zero to Hero: The Complete Beginner’s Guide (2025 Edition)
▶︎

Kubernetes Zero to Hero: The Complete Beginner’s Guide (2025 Edition)

Rachid Zarouali—How I Build a Cloud-Agnostic, Kubernetes-as-a-Service Platform 100% OSS in a Month
▶︎

Rachid Zarouali—How I Build a Cloud-Agnostic, Kubernetes-as-a-Service Platform 100% OSS in a Month

AI Agents on Kubernetes: KAgent
▶︎

AI Agents on Kubernetes: KAgent

Keynote: After the AI Hype – What’s Real, and What’s Next - Richard Campbell - 2026
▶︎

Keynote: After the AI Hype – What’s Real, and What’s Next - Richard Campbell - 2026

AI Agents on Kubernetes: Introduction
▶︎

AI Agents on Kubernetes: Introduction

The FDE Playbook for AI Startups with Bob McGrew
▶︎

The FDE Playbook for AI Startups with Bob McGrew

How AI agents & Claude skills work (Clearly Explained)
▶︎

How AI agents & Claude skills work (Clearly Explained)

Who Are You? Configuring Kafka Authentication in Strimzi from TLS to Custom Principals | D. Mulder
▶︎

Who Are You? Configuring Kafka Authentication in Strimzi from TLS to Custom Principals | D. Mulder

200 DIOS TE DICE HOY: ESCUCHA ESTO ANTES DE DORMIR, MI VOZ TE DARÁ PAZ Y DESCANSO
▶︎

200 DIOS TE DICE HOY: ESCUCHA ESTO ANTES DE DORMIR, MI VOZ TE DARÁ PAZ Y DESCANSO

Cilium, Hubble & Tetragon for AWS EKS
▶︎

Cilium, Hubble & Tetragon for AWS EKS

Kubernetes Tutorial for Beginners [FULL COURSE in 4 Hours]
▶︎

Kubernetes Tutorial for Beginners [FULL COURSE in 4 Hours]

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan
▶︎

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan

I Spent 20 Days Building the Cheapest Forest House Alone to Live: Solo Bushcraft (Full)
▶︎

I Spent 20 Days Building the Cheapest Forest House Alone to Live: Solo Bushcraft (Full)

If you need calm, you'll feel this on your skin (comfort for restless minds)
▶︎

If you need calm, you'll feel this on your skin (comfort for restless minds)

الرقية الشرعية للشفاءمن السحروالعين والحسد حصن من الشيطان رقية البيت والاولاد بصوت القارئ سعيد حمدان
▶︎

الرقية الشرعية للشفاءمن السحروالعين والحسد حصن من الشيطان رقية البيت والاولاد بصوت القارئ سعيد حمدان

Complete Terraform Course - From BEGINNER to PRO! (Learn Infrastructure as Code)
▶︎

Complete Terraform Course - From BEGINNER to PRO! (Learn Infrastructure as Code)

Swapping the Engine Mid-Flight: How We Moved Reddit’s Petabyte Scale Kafka Fleet to ... | S. Kistler
▶︎

Swapping the Engine Mid-Flight: How We Moved Reddit’s Petabyte Scale Kafka Fleet to ... | S. Kistler

Inside the Mind of Anthropic CEO Dario Amodei | The Circuit | Extended Interview
▶︎

Inside the Mind of Anthropic CEO Dario Amodei | The Circuit | Extended Interview

Andrew Ng: Building Faster with AI
▶︎

Andrew Ng: Building Faster with AI