Storing State Forever: Why It Can Be Good For Your Analytics
State is an essential part of the modern streaming pipelines: it enables a variety of foundational capabilities like windowing, aggregation, enrichment, etc. But usually, the state is either transient, so we only keep it until the window is closed, or it's fairly small and doesn't grow much. But what if we treat the state differently? The keyed state in Flink can be scaled vertically and horizontally, it's reliable and fault-tolerant... so is scaling a stateful Flink application that different from scaling any data store like Kafka or MySQL? At Shopify, we've worked on a massive analytical data pipeline that's needed to support complex streaming joins and correctly handle arbitrarily late-arriving data. We came up with an idea to never clear state and support joins this way. We've made a successful proof of concept, ingested all historical transactional Shopify data and ended up storing more than 10 TB of Flink state. In the end, it allowed us to achieve 100% data correctness.

KEYNOTE: Apache Flink in the Cloud-native Era

Paper Moon: A Comparison of Stateful Functions and Pulsar Functions

"A.I. and Our Economic Future," Professor Chad Jones

The Reason I Didn't Buy SpaceX IPO — And Why I'm Holding $397 BILLION Instead | WARREN BUFFETT

The Most Misunderstood Concept in Physics

Flink Agents: The Agentic AI Framework based on Apache Flink

Apache Fluss and the Seven Deadly Sins of Streaming

Apache Fluss: Making Your Lakehouse Truly Real Time

Log Normalization The Art of Providing Detection Ready Data Szilárd Parrag

Unbelievable Workers | Working with Talented Engineers #46 #fail #adamrose #smartworkers

Stanford Luck Researcher: How to Manifest the Life You Want

I turned an old van into a 2-STORY tiny house

How To Think SO CLEARLY People Assume You're A Genius

Streaming Down the Fluss: Taming CDC Streams with Flink, Fluss, and Paimon

Flink State Management: A Journey from Core Primitives to Next-Generation Incremental Computation

Navigating Workplace Politics: A Guide for Leaders with Dr. Rick Brandon & Charles Good | TGLP #55

Enabling Apache Flink Management Platforms as a Service in the AI Stack

Flink Forward Barcelona 2025: Opening Session and Keynotes

