Anomaly Detection for Data Quality and Metric Shifts at Netflix | Netflix

Recorded at DataEngConf SF17 in April, 2017 In the course of transforming, publishing and visualizing data, there’s risk of “bad data” creeping into your output at every turn, hurting data credibility and distracting teams from investigating real metric shifts. How does Netflix prevent bad data from causing bad decision-making? We use a variety of techniques to automate the basics, allowing us to focus our energy on the changes in data that indicate real problems with the Netflix product. Hear examples of 1) the checks we impose at multiple steps of the data pipeline to identify source data quality issues and business metric shifts, 2) techniques for anomaly detection on datasets with many dimensions that are highly cardinal, 3) how to set up evaluations in an automated fashion and 4) how we make it easy for humans to investigate issues. ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers. Make sure to subscribe to our channel for more videos, including DC_THURS, our series of live online interviews with leading data professionals from top open source projects and startups. FOLLOW DATA COUNCIL: Twitter:   / datacouncilai   LinkedIn:   / datacouncil-ai   Facebook:   / datacouncilai   Eventbrite: https://www.eventbrite.com/o/data-cou... - 🎟️ GET YOUR TICKET TO AI COUNCIL 2026 🎟️ Meet the world's top AI infrastructure minds where architects of AI share what works. Three days of high-quality technical talks and meaningful interactions. → https://aicouncil.com/sf-2026 ⚡ FIND US: X: https://x.com/AICouncilConf LinkedIn:   / aicouncilconf   Website: https://aicouncil.com/

Hoodie: An Open Source Incremental Processing Framework From Uber | Uber
▶︎

Hoodie: An Open Source Incremental Processing Framework From Uber | Uber

Anomaly Detection for Payment Processing at Netflix, Shankar Vedaraman & Chris Colburn 20150126
▶︎

Anomaly Detection for Payment Processing at Netflix, Shankar Vedaraman & Chris Colburn 20150126

Machine Learning Applications for Energy Efficiency and Customer Care
▶︎

Machine Learning Applications for Energy Efficiency and Customer Care

Anomaly Detection: Algorithms, Explanations, Applications
▶︎

Anomaly Detection: Algorithms, Explanations, Applications

Data Quality KPIs to C-Level in Fintech: 8 Governance Principles
▶︎

Data Quality KPIs to C-Level in Fintech: 8 Governance Principles

Mastering Chaos - A Netflix Guide to Microservices
▶︎

Mastering Chaos - A Netflix Guide to Microservices

Data Mesh in Practice - Assuring Data Quality at Scale - Gayathri Thiyagarajan - DDD Europe 2022
▶︎

Data Mesh in Practice - Assuring Data Quality at Scale - Gayathri Thiyagarajan - DDD Europe 2022

Anomaly Detection 101 - Elizabeth (Betsy) Nichols Ph.D.
▶︎

Anomaly Detection 101 - Elizabeth (Betsy) Nichols Ph.D.

Turing Award Winner: Disagreeing with Google, Postgres, Future Problems | Mike Stonebraker
▶︎

Turing Award Winner: Disagreeing with Google, Postgres, Future Problems | Mike Stonebraker

Data Reliability Engineering: A New Approach to Data Quality | Bigeye
▶︎

Data Reliability Engineering: A New Approach to Data Quality | Bigeye

Unit8 Talks #7 - Fraud detection - A guide to building a financial transaction anomaly detector
▶︎

Unit8 Talks #7 - Fraud detection - A guide to building a financial transaction anomaly detector

Everything You Need to Know About Big Data: From Architectural Principles to Best Practices
▶︎

Everything You Need to Know About Big Data: From Architectural Principles to Best Practices

Mastering Enterprise Data Quality Monitoring & Management | TestingXperts x Datagaps
▶︎

Mastering Enterprise Data Quality Monitoring & Management | TestingXperts x Datagaps

Future of Data Engineering
▶︎

Future of Data Engineering

Robust anomaly detection for real user monitoring data - Velocity 2016, Santa Clara, CA
▶︎

Robust anomaly detection for real user monitoring data - Velocity 2016, Santa Clara, CA

How Netflix Handles Data Streams Up to 8M Events/sec
▶︎

How Netflix Handles Data Streams Up to 8M Events/sec

Watch Everything, Watch Anything: Anomaly Detection By Nathaniel Cook
▶︎

Watch Everything, Watch Anything: Anomaly Detection By Nathaniel Cook

Anomaly Detection - Nick Radcliffe
▶︎

Anomaly Detection - Nick Radcliffe

Data Pipeline Frameworks: The Dream and the Reality |  Beeswax
▶︎

Data Pipeline Frameworks: The Dream and the Reality | Beeswax

Deep Dive into LLMs like ChatGPT
▶︎

Deep Dive into LLMs like ChatGPT