How to integrate Great Expecation Data Quality tests in Airflow? | Data pipeline | Data Quality
In this video, we will cover how to integrate Great Expectation Data Quality tests in Apache Airflow. In this session, we will use the Great Expectation (GE) provider for Airlow and run the Great Expectations suite. Our data asset will be a PostgreSQL table. In this tutorial, we will see how to test an ETL Pipeline with Great Expecations using Python. It is essential to test the quality of data before it lands in our production systems. We will focus on Product dimension and employ various built-in GE Data Quality tests. Links to related sessions. Link to GitHub (Updated DAG): https://github.com/hnawaz007/pythonda... Airflow Installation & Configuration with custom image: • Airflow Installation & Configurations | Em... In the custom image we add the following line to install GE provider: && pip install airflow-provider-great-expectations Orchestrate SQL Data Pipelines with Airflow: • Orchestrate SQL Data Pipelines with Airflo... How to test your Data Pipelines with Great Expectations: • How to test your Data Pipelines with Great... How to create Great Epxectations suite? • How to create Great Epxectations suite? Qu... Link to GE Expectations notebook: https://github.com/hnawaz007/pythonda... Link to GE suite used in the vidoe: https://github.com/hnawaz007/pythonda... Link to Channel's site: https://hnawaz007.github.io/ -------------------------------------------------------------- 💥Subscribe to our channel: / haqnawaz 📌 Links ----------------------------------------- #️⃣ Follow me on social media! #️⃣ 🔗 GitHub: https://github.com/hnawaz007 📸 Instagram: / bi_insights_inc 📝 LinkedIn: / haq-nawaz 🔗 / hnawaz100 🚀 https://hnawaz007.github.io/ ----------------------------------------- #ETL #dataquality #Airflow Topics in this video (click to jump around): ================================== 0:00 - Introduction to Great Expectations Data Quality 0:49 - Prerequisites 1:16 - Create Great Expectation suite 1:46 - Review Great Expectation Data Quality Tests 2:29 - Airflow DAG 2:49 - Integrate Great Expectations Data Quality in Airflow 3:34 - Airflow UI: Dag review & run 3:56 - DAG logs: review Data Quality test run

Data Lakehouse workflow Apache Iceberg and Nessie | How Iceberg works | Nessie Branch & Merge

Always know what to expect from your data with great_expectations

Delta Lake - EXPLAINED - Full Tutorial

Grafana User Guide

Build an end to end data lake etl pipeline | Airflow | Iceberg | dbt | Trino | Postgres

How to Build a Data Quality Monitoring System with Great Expectations & Airflow

Build an End-to-End ETL Pipeline with Python & PostgreSQL

Airflow Vs. Dagster: The Full Breakdown!

Data Lineage with OpenLineage and Airflow

Advanced Data Quality Use Cases with Airflow and Great Expectations

Great Expectation Tutorial | Mastering Data Quality

ETL vs ELT: Powering Data Pipelines for AI & Analytics
![Top 10 Data Quality Questions Asked In Data Engineering Interviews [2025 Guide] #dataquality](https://i.ytimg.com/vi/wjMY7ayPaJ4/hqdefault.jpg?sqp=-oaymwEjCNACELwBSFryq4qpAxUIARUAAAAAGAElAADIQj0AgKJDeAE=&rs=AOn4CLDSyzgCjNqIKUI0G3rbXbfQ2knghg)
Top 10 Data Quality Questions Asked In Data Engineering Interviews [2025 Guide] #dataquality

Building a robust data pipeline with the dAG stack dbt, Airflow, Great Expectations

Don't Use Apache Airflow

Implementing Data Quality in Python w/ Great Expectations

Airflow DAG: Make your data pipelines better!

CI/CD for Databricks: Advanced Asset Bundles and GitHub Actions

What's new in Apache Airflow 3?

