Open-Source Spotlight - Great Expectations (Data Quality Platform) - James Campbell
Great Expectations Cofounder James Campbell presents a demo of the open-source data quality platform Great Expectations. 00:00 Introducing James Campbell and Great Expectations 02:13 Demo: Notebook setup 03:00 NYC taxi data example - exploratory analysis flow 04:00 Use out-of-band knowledge to build Expectations 05:15 Use the Mostly parameter to turn row-level Expectations to batch-level 05:30 Build Data Docs from Expectations and Validation Results 06:50 Use a Data Assistant to Suggest Expectations based on previous batches 09:20 Run a Checkpoint to validate new batches of data 12:30 Add validation actions in Checkpoint configurations 13:25 Demo: Onboarding Data Assistant creates a more comprehensive picture of the dataset 16:04 Workflow for using Great Expectations in a pipeline 17:03 Great Expectations accesses data from any source during Expectation Validation 17:50 Expectations on Pandas or SQL backends 19:20 Great Expectations Cloud to support configuration management and users 20:27 The team and contributors behind Great Expectations 21:00 Join the Slack community 22:46 Advice: Work on the things you love Links: Great Expectations website: https://greatexpectations.io MLOps Zoomcamp: https://github.com/DataTalksClub/mlop... Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html

Always know what to expect from your data with great_expectations

Great Expectation Tutorial | Mastering Data Quality

Data Quality as part of the Data Pipeline

Why Aliens Would NEVER Invade Africa

Why use DuckDB in your data pipelines ft. Niels Claeys

How to Build a Data Quality Monitoring System with Great Expectations & Airflow

AWS re:Invent 2022 - Simplify & accelerate data integration & ETL modernization w/AWS Glue (ANT223)

Lakehouse data validation with Great Expectations in Microsoft Fabric

Building Real-time Applications with ClickHouse Materialized Views

Learn Practical Techniques for Applying Data Quality in the Lakehouse with Databricks

What is Databricks? The Story Behind the Modern Data Platform (Visual Explanation)

How To Think SO CLEARLY People Assume You're A Genius

Elevating Data Quality: Great Expectations and Airflow at PepsiCo

How to Create a Shared, Open Standard for Data Quality | Great Expectations

Turing Award Winner: Disagreeing with Google, Postgres, Future Problems | Mike Stonebraker

Great Expectations (GX) for DATA Testing - Introduction

Advanced Data Quality Use Cases with Airflow and Great Expectations

Data Contracts - Accountable Data Quality | Data Quality Camp

Great Expectations and Databricks notebooks: a powerful alliance for validated data workflows

