Build an end to end data lake etl pipeline | Airflow | Iceberg | dbt | Trino | Postgres
In this video we are covering the end to end on-premise data lake/lakehouse setup! We will simplify the makeup of the Datalake. It will be SQL based. We utilize Apache Airflow, Iceberg, dbt, MinIO, Postgres and Trino. We removed the JVM based metastore from the equation in the Python based setup and will continue on that trend. Be sure to check out the related links to get familiar with the tech stack. 🔗 Tools setup guide: Airflow overview setup link:    • Airflow Installation & Configurations | Em...  dbt series link:    • Data Build Tool (dbt)  MinIO setup link:    • How to build on-premise Data Lake? | Build...  Iceberg setup link:    • Data Lakehouse workflow Apache Iceberg and...  Postgres setup link:    • How to install PostgreSQL  on windows | cr...  Install Python:    • Install Jupyter Notebook on Windows | Pyth...  Link to GitHub repo: https://github.com/hnawaz007/datalake... 💡 Why This Matters: No more JVM setup or Hive metastore requirements! With this modern stack, setting up a Datalake becomes faster, leaner, and more efficient. This will give you and end to end overview of the Datalake setup and how to perform data engineering task in the Datalake setup. 👉 Start small with your flat file source and let MinIO + Iceberg + Postgres handle the rest. #DataLake #Iceberg #dbt #DataEngineering #SimplifiedSetup

Build a data lake Apache Iceberg and Apache Arrow | Build Data Lake | Open Source Tools | On-Premise

Build an End-to-End ETL Pipeline with Python & PostgreSQL

End to End Modern Distributed Data Lakehouse using Apache Iceberg, Trino, Airflow, DBT and Minio

70 SQL Interview Questions Practice Series - Part II

UiPath DevCon 2026 Highlights - Innovation in Action

Apache Iceberg: What It Is and Why Everyone’s Talking About It.

End-to-End E-Commerce Data Pipeline with Snowflake, dbt & Airflow | Delayed Orders Alterting

Create on premise Data Lakehouse with Apache Iceberg | Nessie | MinIO | Lakehouse

How do Time Series Databases Work?

Building an ingestion architecture for Apache Iceberg

AWS re:Invent 2023 - Netflix’s journey to an Apache Iceberg–only data lake (NFX306)

Understanding DuckLake: A Table Format with a Modern Architecture

How To Build An Open Data Lakehouse On Snowflake With Apache Iceberg

End to end ETL pipeline project using Docker, Airflow, PostgresDB and Metabase | Data Engineering

Apache Iceberg Deep Dive | Part 1 | Crash Course

The AI-native Data Engineer

Databricks End-To-End Project 2026 | Zero-To-Hero

I replaced my entire stack with Postgres...

Apache Airflow One Shot- Building End To End ETL Pipeline Using AirFlow And Astro

