Data Lake Fundamentals, Apache Iceberg and Parquet in 60 minutes on DataExpert.io

We'll be covering data lakes, parquet file format, data compression and shuffle! Make sure to have a https://www.DataExpert.io account here so you can get the most of this lab!

Apache Iceberg: What It Is and Why Everyone’s Talking About It.
▶︎

Apache Iceberg: What It Is and Why Everyone’s Talking About It.

What Is A Parquet File? - Structure of Parquet - Encoding Optimizations
▶︎

What Is A Parquet File? - Structure of Parquet - Encoding Optimizations

How Meta Models Big Volume Event Data  - Full 4 Hour Course - DataExpert.io Free Boot Camp Week 2
▶︎

How Meta Models Big Volume Event Data - Full 4 Hour Course - DataExpert.io Free Boot Camp Week 2

Data Lake Modeling: 100 TBs into 5 TBs at Airbnb with Parquet + Run Length Encoding - DataExpert.io
▶︎

Data Lake Modeling: 100 TBs into 5 TBs at Airbnb with Parquet + Run Length Encoding - DataExpert.io

Data Ecosystem Explained: Data Lake vs Delta Lake vs Lakehouse vs Data Warehouse
▶︎

Data Ecosystem Explained: Data Lake vs Delta Lake vs Lakehouse vs Data Warehouse

Apache Iceberg - More Than A Table Format | Distributed Systems Deep Dives With Ex-Google SWE
▶︎

Apache Iceberg - More Than A Table Format | Distributed Systems Deep Dives With Ex-Google SWE

Dimensional data modeling and idempotent pipelines in 78 minutes with DataExpert.io
▶︎

Dimensional data modeling and idempotent pipelines in 78 minutes with DataExpert.io

Parquet File Format - Explained to a 5 Year Old!
▶︎

Parquet File Format - Explained to a 5 Year Old!

What is Apache Iceberg?
▶︎

What is Apache Iceberg?

Apache Spark Was Hard Until I Learned These 30 Concepts!
▶︎

Apache Spark Was Hard Until I Learned These 30 Concepts!

An Extremely Technical Overview of How Apache Iceberg Planning Actually Works (Russell Spitzer)
▶︎

An Extremely Technical Overview of How Apache Iceberg Planning Actually Works (Russell Spitzer)

Apache Iceberg vs Delta Lake – Which Open Table Format Should You Choose in 2025?
▶︎

Apache Iceberg vs Delta Lake – Which Open Table Format Should You Choose in 2025?

Spark + Iceberg in 1 Hour - Memory Tuning, Joins, Partition - Week 3 Day 1 - DataExpert.io Boot Camp
▶︎

Spark + Iceberg in 1 Hour - Memory Tuning, Joins, Partition - Week 3 Day 1 - DataExpert.io Boot Camp

Apache Iceberg Deep Dive | Part 1 | Crash Course
▶︎

Apache Iceberg Deep Dive | Part 1 | Crash Course

Master Real-time Data Pipelines with Kafka and Flink - 3 hr Course - DataExpert.io Free Boot Camp
▶︎

Master Real-time Data Pipelines with Kafka and Flink - 3 hr Course - DataExpert.io Free Boot Camp

Data Warehouse vs Data Lake vs Data Lakehouse | ETL, OLAP vs OLTP
▶︎

Data Warehouse vs Data Lake vs Data Lakehouse | ETL, OLAP vs OLTP

The Parquet Format and Performance Optimization Opportunities Boudewijn Braams (Databricks)
▶︎

The Parquet Format and Performance Optimization Opportunities Boudewijn Braams (Databricks)

How a Netflix Side Project Became the Universal Standard for Data Tables
▶︎

How a Netflix Side Project Became the Universal Standard for Data Tables

AWS re:Invent 2023 - Netflix’s journey to an Apache Iceberg–only data lake (NFX306)
▶︎

AWS re:Invent 2023 - Netflix’s journey to an Apache Iceberg–only data lake (NFX306)

End to End Modern Distributed Data Lakehouse using Apache Iceberg, Trino, Airflow, DBT and Minio
▶︎

End to End Modern Distributed Data Lakehouse using Apache Iceberg, Trino, Airflow, DBT and Minio