Interview with NVIDIA Dynamo Architect Kyle Kranen

In this episode, Nader and Carter interview NVIDIA Dynamo architect Kyle Kranen to learn about what Dynamo is and how it can make models like DeepSeek-R1 increase throughput by up to 30x! You have 3 levers when running inference on AI models: quality, cost, speed. For example: reasoning models like DeepSeek-R1 do test-time scaling, where asking the model to think improves quality but reduces speed and increases costs. We dive into how NVIDIA Dynamo gives you the ability to tweak all 3 levers through techniques like disaggregation, kv offloading, and kv routing. Read: https://developer.nvidia.com/blog/int... Follow Kyle ➡️   / kyle-kranen   Follow Carter ➡️   / carter-abdallah-958666140   Follow Nader ➡️   / naderlikeladder  

Beyond the Algorithm with NVIDIA:  Introducing NVIDIA Dynamo
▶︎

Beyond the Algorithm with NVIDIA: Introducing NVIDIA Dynamo

Inside Anthropic, the $965 Billion AI Juggernaut | The Circuit
▶︎

Inside Anthropic, the $965 Billion AI Juggernaut | The Circuit

By the [run]Book: Episode 23
▶︎

By the [run]Book: Episode 23

NVIDIA Dynamo: High performance Open Source Interface | William Arnold | AER Labs
▶︎

NVIDIA Dynamo: High performance Open Source Interface | William Arnold | AER Labs

NVIDIA CEO Jensen Huang's Vision for the Future
▶︎

NVIDIA CEO Jensen Huang's Vision for the Future

Getting Started with CUDA and Parallel Programming | NVIDIA GTC 2025 Session
▶︎

Getting Started with CUDA and Parallel Programming | NVIDIA GTC 2025 Session

Robotics' End Game: Nvidia's Jim Fan
▶︎

Robotics' End Game: Nvidia's Jim Fan

Leading in the Age of AI: A Conversation with NVIDIA CEO Jensen Huang | Global Conference 2026
▶︎

Leading in the Age of AI: A Conversation with NVIDIA CEO Jensen Huang | Global Conference 2026

AI Perf benchmarking - Dynamo and other LLM endpoints
▶︎

AI Perf benchmarking - Dynamo and other LLM endpoints

START YOUR TUESDAY WITH FAITH | TODAY GOD IS GIVING YOU UNEXPECTED OPPORTUNITIES | FATHER FREDDY ...
▶︎

START YOUR TUESDAY WITH FAITH | TODAY GOD IS GIVING YOU UNEXPECTED OPPORTUNITIES | FATHER FREDDY ...

NVIDIA Triton Inference Server and its use in Netflix's Model Scoring Service
▶︎

NVIDIA Triton Inference Server and its use in Netflix's Model Scoring Service

Jensen Huang of Nvidia on the Future of A.I. | DealBook Summit 2023
▶︎

Jensen Huang of Nvidia on the Future of A.I. | DealBook Summit 2023

Announcing NVIDIA RTX Spark | GTC Taipei 2026 Keynote by CEO Jensen Huang
▶︎

Announcing NVIDIA RTX Spark | GTC Taipei 2026 Keynote by CEO Jensen Huang

Insights from NVIDIA Research | NVIDIA GTC
▶︎

Insights from NVIDIA Research | NVIDIA GTC

Stanford CS153 Frontier Systems | Jensen Huang from NVIDIA on the Compute Behind Intelligence
▶︎

Stanford CS153 Frontier Systems | Jensen Huang from NVIDIA on the Compute Behind Intelligence

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan
▶︎

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan

Exploring the Latency/Throughput & Cost Space for LLM Inference // Timothée Lacroix // CTO Mistral
▶︎

Exploring the Latency/Throughput & Cost Space for LLM Inference // Timothée Lacroix // CTO Mistral

NVIDIA Dynamo Developer Office Hours
▶︎

NVIDIA Dynamo Developer Office Hours

Jensen Huang: Nvidia's Future, Physical AI, Rise of the Agent, Inference Explosion, AI PR Crisis
▶︎

Jensen Huang: Nvidia's Future, Physical AI, Rise of the Agent, Inference Explosion, AI PR Crisis

JUST RECORDED: Elon Musk Announces SPACEX Plans
▶︎

JUST RECORDED: Elon Musk Announces SPACEX Plans