Efficiently distributing Postgres with Citus – How to choose the right shard key? | Citus Con 2022

This video of Sai Srirampur’s talk at Citus Con: An Event for Postgres explains why the “Distribution Column” (or shard key) lies at the heart of the Citus superpower of distributed PostgreSQL tables. The shard key determines how data is distributed (aka sharded) in the database cluster and how efficiently your Postgres queries are executed by Citus. In this talk from the Americas livestream at Citus Con, Sai shares various criteria required to choose an optimal shard key for Citus. Some of the sharding key criteria include workload type, table sizes, cardinality of column(s), commonality of column(s) across tables etc. Sai walks through each of them using real-world examples and presents an organized approach you can follow to pick the right distribution column for your PostgreSQL database workload. Finally, the icing on the cake, Sai explores the possibilities and ideas for a tool which heuristically assesses the shard key criteria to automatically predict optimal shard key(s). Sai Srirampur leads the Customer Engineering team for the Citus extension to Postgres within Microsoft. Sai’s team is responsible for making technical onboarding easy for customers. This includes providing direct expert guidance to customers who are implementing Citus and building tools to make the database migration journey seamless. ► Video bookmarks: ⏩ 00:00 Introduction ⏩ 01:31 What is Citus? ⏩ 04:24 What is a distribution column & why is it important? ⏩ 09:22 3 steps to building highly scalable apps with Citus ⏩ 10:43 Multi-tenant SaaS apps ⏩ 15:40 Real-time operational analytics apps ⏩ 19:33 High throughput transactional apps ⏩ 22:14 Exploring automation of choosing the right shard key ⏩ 24:31 Q&A with Sai & Marco ✅ Learn more: Watch more Citus Con talks: https://aka.ms/cituscon-playlist 📕 Everything you need to know about Citus Con: An Event for Postgres can be found at: https://aka.ms/cituscon 📌 Let’s connect: Twitter – @CitusCon,   / cituscon   🔔 Subscribe to the Citus monthly technical newsletter: https://aka.ms/citus-newsletter #CitusCon #PostgreSQL #Citus

Explaining the Postgres Query Optimizer | Citus Con: An Event for Postgres 2022
▶︎

Explaining the Postgres Query Optimizer | Citus Con: An Event for Postgres 2022

Claire Giordano: Beginner's Guide to Partitioning vs. Sharding in Postgres (PGConf.EU 2023)
▶︎

Claire Giordano: Beginner's Guide to Partitioning vs. Sharding in Postgres (PGConf.EU 2023)

Lessons learned scaling our SaaS on Postgres to 8+ billion events | Citus Con 2022
▶︎

Lessons learned scaling our SaaS on Postgres to 8+ billion events | Citus Con 2022

Self Hosted Citus using Cloud Computing | Citus Con: An Event for Postgres 2023
▶︎

Self Hosted Citus using Cloud Computing | Citus Con: An Event for Postgres 2023

ClickHouse: what is behind the fastest open source columnar database
▶︎

ClickHouse: what is behind the fastest open source columnar database

Turing Award Winner: Disagreeing with Google, Postgres, Future Problems | Mike Stonebraker
▶︎

Turing Award Winner: Disagreeing with Google, Postgres, Future Problems | Mike Stonebraker

The Challenges of Distributing Postgres: A Citus Story | Citus Data
▶︎

The Challenges of Distributing Postgres: A Citus Story | Citus Data

KEYNOTE: The Distributed PostgreSQL Problem & How Citus Solves it | Citus Con 2023
▶︎

KEYNOTE: The Distributed PostgreSQL Problem & How Citus Solves it | Citus Con 2023

Why Aliens Would NEVER Invade Africa
▶︎

Why Aliens Would NEVER Invade Africa

How Instagram Scaled Postgres to 2 Billion Users
▶︎

How Instagram Scaled Postgres to 2 Billion Users

Citus & Patroni: The Key to Scalable and Fault-Tolerant PostgreSQL | Citus Con 2023
▶︎

Citus & Patroni: The Key to Scalable and Fault-Tolerant PostgreSQL | Citus Con 2023

Citus from the Customer eyes | Citus Con: An Event for Postgres 2023
▶︎

Citus from the Customer eyes | Citus Con: An Event for Postgres 2023

Inside Anthropic, the $965 Billion AI Juggernaut | The Circuit
▶︎

Inside Anthropic, the $965 Billion AI Juggernaut | The Circuit

Citus: Distributed PostgreSQL as an Extension (Microsoft Citus Data)
▶︎

Citus: Distributed PostgreSQL as an Extension (Microsoft Citus Data)

🚗 BYD : The biggest SCAM of the car industry ?
▶︎

🚗 BYD : The biggest SCAM of the car industry ?

What is Databricks? The Story Behind the Modern Data Platform (Visual Explanation)
▶︎

What is Databricks? The Story Behind the Modern Data Platform (Visual Explanation)

When You Try To Be Microsoft…The Fall Of Salesforce
▶︎

When You Try To Be Microsoft…The Fall Of Salesforce

How to scale Postgres for time series data with Citus | Citus Con: An Event for Postgres 2022
▶︎

How to scale Postgres for time series data with Citus | Citus Con: An Event for Postgres 2022

DEMO 10X Faster Analytics Queries on Postgres with Citus
▶︎

DEMO 10X Faster Analytics Queries on Postgres with Citus

Kafka Tutorial for Beginners | Everything you need to get started
▶︎

Kafka Tutorial for Beginners | Everything you need to get started