Datadog on Kubernetes Monitoring
With many blog posts published and talks given on the topic, it’s no secret that Datadog is running Kubernetes at scale. We currently run dozens of clusters, some of them with thousands of nodes. Additionally, we have clusters running in multiple clouds. How are we monitoring all of that, ensuring we can scale up quickly and safely? In this session Ara Pulido, Technical Evangelist, will chat with Celene Chang and Charly Fontaine - both software engineers on the Container Integrations team at Datadog. This team is responsible for deploying and running the Datadog Agent in our Kubernetes clusters. We’ll cover how we are running the Datadog Agent in our clusters, which metrics we care about, and the monitors we have set up. By the end of the session you will have new ideas and best practices on monitoring Kubernetes with Datadog that you can apply in your own environment. Links mentioned in the talk ExtendedDaemonset Github: https://github.com/DataDog/extendedda... Watermark Pod Autoscaler Github: https://github.com/DataDog/watermarkp... How to monitor Kubernetes audit logs: https://www.datadoghq.com/blog/monito... Explore Kubernetes resources with Datadog Live Containers: https://www.datadoghq.com/blog/explor... 00:00 - Intro 02:01 - Main discussion 06:19 - Kubernetes Monitoring 101 14:36 - Best practices: Agent deployment 18:28 - Best practices: Platform monitoring 29:23 - Best practices: Audit logs 31:08 - Best practices: Workload monitoring 34:36 - Best practices: Tagging 38:45 - Autoscaling 48:48 - KubeCon discussion 53:29 - Q&A

Datadog on Software Delivery

Kubernetes Zero to Hero: The Complete Beginner’s Guide (2025 Edition)

Datadog on Security and Compliance

Kubernetes and retiring at the top with Kelsey Hightower

Kubernetes Monitoring Made Easy with Prometheus | KodeKloud

RL for Agents Workshop - Deep Dive on Training Agents with RL and Open Source

Complete Kubernetes Course - From BEGINNER to PRO

DASH by Datadog 2025 Keynote

Datadog on Site Reliability Engineering

Datadog on Kubernetes Node Management

Ensuring Reliability with SLOs with Datadog & Google Cloud
![Kubernetes Tutorial for Beginners [FULL COURSE in 4 Hours]](https://i.ytimg.com/vi/X48VuDVv0do/hqdefault.jpg?sqp=-oaymwEjCNACELwBSFryq4qpAxUIARUAAAAAGAElAADIQj0AgKJDeAE=&rs=AOn4CLDNg7nINwKqigXGqrL80FN9YuTNGg)
Kubernetes Tutorial for Beginners [FULL COURSE in 4 Hours]

Datadog on Kubernetes

Jfrog | Jfrog Artifactory | Jfrog Artifactory Tutorial | Artifactory Tutorial | Intellipaat

Datadog on Kafka

Modern Architecture 101 for New Engineers & Forgetful Experts - Jerry Nixon - NDC Copenhagen 2025
![Kubernetes Crash Course for Absolute Beginners [NEW]](https://i.ytimg.com/vi/s_o8dwzRlu4/hqdefault.jpg?sqp=-oaymwEjCNACELwBSFryq4qpAxUIARUAAAAAGAElAADIQj0AgKJDeAE=&rs=AOn4CLAfg4KRReNtQkLAjORAuzDyyoaBFg)
Kubernetes Crash Course for Absolute Beginners [NEW]

Azure Kubernetes Service (AKS) Networking Deep Dive

Observability and Security for the AI Era

