Parquet File Format - Explained to a 5 Year Old!

Parquet file format has become a de-facto standard for storing data nowadays. This video will teach you EVERYTHING you should know about Parquet and Delta formats, in an easy-to-understand way. https://data-mozart.com/parquet-file-... 00:00 - Introduction 02:05 - Why Parquet? 03:06 - Understanding row-based storage 04:06 - Understanding column-based storage 05:01 - Understanding Parquet storage with row groups 05:43 - Understanding projection and predicate(s) 07:30 - Performance tips and optimal file size 08:31 - Data compression in Parquet 10:04 - Understanding Delta format

Tableflow: Materialize Apache Kafka® Topics as Apache Iceberg™ and Delta Lake Tables With Zero ETL
▶︎

Tableflow: Materialize Apache Kafka® Topics as Apache Iceberg™ and Delta Lake Tables With Zero ETL

What is Databricks? The Story Behind the Modern Data Platform (Visual Explanation)
▶︎

What is Databricks? The Story Behind the Modern Data Platform (Visual Explanation)

The Parquet Format and Performance Optimization Opportunities Boudewijn Braams (Databricks)
▶︎

The Parquet Format and Performance Optimization Opportunities Boudewijn Braams (Databricks)

Microsoft Fabric - The Ultimate Guide for 2025!
▶︎

Microsoft Fabric - The Ultimate Guide for 2025!

Data Warehouse vs Data Lake vs Data Lakehouse | ETL, OLAP vs OLTP
▶︎

Data Warehouse vs Data Lake vs Data Lakehouse | ETL, OLAP vs OLTP

What Is A Parquet File? - Structure of Parquet - Encoding Optimizations
▶︎

What Is A Parquet File? - Structure of Parquet - Encoding Optimizations

Data Lake Modeling: 100 TBs into 5 TBs at Airbnb with Parquet + Run Length Encoding - DataExpert.io
▶︎

Data Lake Modeling: 100 TBs into 5 TBs at Airbnb with Parquet + Run Length Encoding - DataExpert.io

Understanding DuckLake: A Table Format with a Modern Architecture
▶︎

Understanding DuckLake: A Table Format with a Modern Architecture

Fabric Data Engineering: Implementing Incremental Refresh in 10 Minutes!
▶︎

Fabric Data Engineering: Implementing Incremental Refresh in 10 Minutes!

What is Apache Iceberg?
▶︎

What is Apache Iceberg?

Avro vs Parquet - comparison of row and column oriented file formats
▶︎

Avro vs Parquet - comparison of row and column oriented file formats

Shortcuts vs. Mirroring in Microsoft Fabric: The Ultimate Guide and H2H Comparison!
▶︎

Shortcuts vs. Mirroring in Microsoft Fabric: The Ultimate Guide and H2H Comparison!

Data Warehouse vs Data Lake vs Data Lakehouse
▶︎

Data Warehouse vs Data Lake vs Data Lakehouse

Data Lake Fundamentals, Apache Iceberg and Parquet in 60 minutes on DataExpert.io
▶︎

Data Lake Fundamentals, Apache Iceberg and Parquet in 60 minutes on DataExpert.io

This INCREDIBLE trick will speed up your data processes.
▶︎

This INCREDIBLE trick will speed up your data processes.

Apache Iceberg: What It Is and Why Everyone’s Talking About It.
▶︎

Apache Iceberg: What It Is and Why Everyone’s Talking About It.

What is Apache Parquet file?
▶︎

What is Apache Parquet file?

Apache Spark Architecture - EXPLAINED!
▶︎

Apache Spark Architecture - EXPLAINED!

David Kriesel: Big Data - Explained in a Completely Different Way | Digital Week Kiel 2021
▶︎

David Kriesel: Big Data - Explained in a Completely Different Way | Digital Week Kiel 2021

Column Oriented Storage (with Parquet!) | Systems Design Interview: 0 to 1 with Ex-Google SWE
▶︎

Column Oriented Storage (with Parquet!) | Systems Design Interview: 0 to 1 with Ex-Google SWE