Antoine Pitrou - Apache Parquet : the standard, efficient file format for tabular data

Session description: As a developer, scientist or analyst dealing with multi-column data, you probably have encountered the Apache Parquet file format. But do you know how it works? Is it just a "binary CSV"? This talk will explain what exactly a Parquet file contains and how it represents your data. It will help you understand the strong points of the Parquet format, its limitations, and how to best make use of it. Connect with us! Website: https://oredev.org LinkedIn:   / oredev   Twitter:   / oredev   Facebook:   / oredev   Instagram:   / oredev