Azure Synapse Analytics | Data Distribution Strategy and Best Practices

In any distributed system, for efficient parallel processing and for better performance, the data distribution strategy to store data evenly and colocation of data across nodes play important roles. In this video, in the context of Azure Synapse Analytics – dedicated SQL pool, I am going to walk you through data distribution strategy - by way of distributions, different data strategies like round robin and hash and finally replicated table - and best practices to provide prescriptive guidelines in-terms of when to use which and what are the consideration for better performance. 0:00 Introduction of distributed system and data distribution 7:14 Table types in SQL pools 8:40 Round Robin Distribution - Introduction 13:18 Hash Distribution - Introduction 16:42 Concept of distribution and how it maps to compute nodes 22:48 Round Robin Vs Hash - Example and performance differences 35:51 Round Robin Vs Hash - Analyze execution plans 42:52 Round Robin Vs Hash - Join Compatibility 49:30 Hash Distribution - Data skewness 51:51 Round Robin - Best Practices and Guidelines 53:58 Hash Distributed - Best Practices and Guidelines 59:17 Replicated Table - Introduction, Best Practices and Guidelines 1:03:24 Replicated Table - Example Thank you for watching, in my next video, I am going to talk in detail about columnstore index and how it helps in improving performance for analytical queries. Stay tuned. GitHub Repo to download deck and script used in the video: https://github.com/AasTrailblazers/Az... Sample Databases https://docs.microsoft.com/en-us/azur... Table Design https://docs.microsoft.com/en-us/azur... https://docs.microsoft.com/en-us/azur... https://docs.microsoft.com/en-us/azur... Memory and concurrency limits https://docs.microsoft.com/en-us/azur...

Azure Synapse Analytics | Index Options | Columnstore Index | Best Practices

Azure Synapse Analytics | Index Options | Columnstore Index | Best Practices

Azure Synapse Analytics | Data Ingestion Patterns, Polybase, and Copy Command

Azure Synapse Analytics | Data Ingestion Patterns, Polybase, and Copy Command

Azure Synapse Analytics Tutorial (From Zero to Pro) | Azure Data Engineering

Azure Synapse Analytics Tutorial (From Zero to Pro) | Azure Data Engineering

Azure Synapse Analytics | Continuous Integration and Continuous Delivery (CI/CD)

Azure Synapse Analytics | Continuous Integration and Continuous Delivery (CI/CD)

Azure Synapse Analytics | Table Partition | Best Practices

Azure Synapse Analytics | Table Partition | Best Practices

Azure Synapse | Azure Synapse Analytics [Full Course] ☁️

Azure Synapse | Azure Synapse Analytics [Full Course] ☁️

Azure Synapse Analytics | Workload Management and Concurrency

Azure Synapse Analytics | Workload Management and Concurrency

Azure Data Factory, Azure Databricks, or Azure Synapse Analytics? When to use what.

Azure Data Factory, Azure Databricks, or Azure Synapse Analytics? When to use what.

André Kamman: Synapse Analytics Dedicated SQL Pool Best Practices

André Kamman: Synapse Analytics Dedicated SQL Pool Best Practices

Azure Synapse Analytics Networking

Azure Synapse Analytics Networking

Turing Award Winner: Disagreeing with Google, Postgres, Future Problems | Mike Stonebraker

Turing Award Winner: Disagreeing with Google, Postgres, Future Problems | Mike Stonebraker

Azure SQL Database: Columnstore indexes | Azure Friday

Azure SQL Database: Columnstore indexes | Azure Friday

Azure Synapse Analytics | Spark pool | Delta Lake - Part 1

Azure Synapse Analytics | Spark pool | Delta Lake - Part 1

Introduction to Data Integration in Azure Synapse Analytics - Cathrine Wilhelmsen

Introduction to Data Integration in Azure Synapse Analytics - Cathrine Wilhelmsen

Databricks Live Bootcamp | Day1: Introduction & Data Analytics

Databricks Live Bootcamp | Day1: Introduction & Data Analytics

Azure Synapse Analytics: A Data Lakehouse - James Serra - PASS Virtual Summit 2020

Azure Synapse Analytics: A Data Lakehouse - James Serra - PASS Virtual Summit 2020

Azure Synapse Analytics | Introduction and Getting Started

Azure Synapse Analytics | Introduction and Getting Started

What Nobody Tells You About Being a Quant

What Nobody Tells You About Being a Quant

SQL Indexes (Visually Explained) | Clustered vs Nonclustered | #SQL Course 35

SQL Indexes (Visually Explained) | Clustered vs Nonclustered | #SQL Course 35

Tiago Balabuch & Filipa Lobão: How to tune Azure Synapse Analytics SQL Dedicated Pool performance

Tiago Balabuch & Filipa Lobão: How to tune Azure Synapse Analytics SQL Dedicated Pool performance