Lessons from the Field:Applying Best Practices to Your Apache Spark Applications with Silvio Fiorito
Apache Spark is an excellent tool to accelerate your analytics, whether you’re doing ETL, Machine Learning, or Data Warehousing. However, to really make the most of Spark it pays to understand best practices for data storage, file formats, and query optimization. As a follow-up of last year’s “Lessons From The Field”, this session will review some common anti-patterns I’ve seen in the field that could introduce performance or stability issues to your Spark jobs. We’ll look at ways of better understanding your Spark jobs and identifying solutions to these anti-patterns to help you write better performing and more stable applications. About: Databricks provides a unified data analytics platform, powered by Apache Spark™, that accelerates innovation by unifying data science, engineering and business. Read more here: https://databricks.com/product/unifie... Connect with us: Website: https://databricks.com Facebook: / databricksinc Twitter: / databricks LinkedIn: / databricks Instagram: / databricksinc Databricks is proud to announce that Gartner has named us a Leader in both the 2021 Magic Quadrant for Cloud Database Management Systems and the 2021 Magic Quadrant for Data Science and Machine Learning Platforms. Download the reports here. https://databricks.com/databricks-nam...

Tuning and Debugging Apache Spark

From Basic to Advanced Aggregate Operators in Apache Spark SQL 2 2 by Examples and their Catalyst Op

Patrick Henry -Serial Entrepreneur & CEO @ Oculi

Making Apache Spark™ Better with Delta Lake

A Tale of Three Apache Spark APIs: RDDs, DataFrames, and Datasets - Jules Damji

The Parquet Format and Performance Optimization Opportunities Boudewijn Braams (Databricks)
![Kubernetes Tutorial for Beginners [FULL COURSE in 4 Hours]](https://i.ytimg.com/vi/X48VuDVv0do/hqdefault.jpg?sqp=-oaymwEjCNACELwBSFryq4qpAxUIARUAAAAAGAElAADIQj0AgKJDeAE=&rs=AOn4CLDNg7nINwKqigXGqrL80FN9YuTNGg)
Kubernetes Tutorial for Beginners [FULL COURSE in 4 Hours]

Apache Spark Performance Troubleshooting at Scale, Challenges, Tools, and Methods with Luca Canali

Spark + Parquet In Depth: Spark Summit East talk by: Emily Curtin and Robbie Strickland

Designing Data-intensive Applications with Martin Kleppmann
![SQL Course for Beginners [Full Course]](https://i.ytimg.com/vi/7S_tz1z_5bA/hqdefault.jpg?sqp=-oaymwEjCNACELwBSFryq4qpAxUIARUAAAAAGAElAADIQj0AgKJDeAE=&rs=AOn4CLCAEolqW9nvnTsvv0q31O_tNsNdIw)
SQL Course for Beginners [Full Course]

Real-Time Data Pipelines Made Easy with Structured Streaming in Apache Spark | Databricks

SparkSQL: A Compiler from Queries to RDDs: Spark Summit East talk by Sameer Agarwal

Free Event: Power BI Beginner to Pro 2026 Edition - Full Hands-On Tutorial

Tricks of the Trade to be an Apache Spark Rock Star - Ted Malaska

The columnar roadmap: Apache Parquet and Apache Arrow

Complete Terraform Course - From BEGINNER to PRO! (Learn Infrastructure as Code)

Deep Dive: Apache Spark Memory Management

How to Automate Performance Tuning for Apache Spark -Jean Yves Stephan (Data Mechanics)

