Moving on from RocksDB to something FASTER - Matthew Brookes
For many years streaming applications requiring larger-than-memory fault-tolerant state have settled for RocksDB as the de facto state backend. This is despite it being optimised for read and range queries rather than the update intensive workloads typically exhibited in stream processing. Several features of RocksDB’s design, such as its key-order page format and Read-Copy-Update approach, become limiting factors in the throughput of state updates. Given these limitations, we have evaluated the use of FASTER, an embedded Key-Value store from Microsoft Research, as an alternative backend that is more suitable for streaming workloads. It uses in-place updates on a changeable “hot” set in-memory and a cache-optimised hash index to ensure a high throughput of point operations on its HybridLog that spans memory and disk. In this talk we present benchmarking results for different streaming workloads highlighting the performance differences between FASTER and RocksDB. We use these results to motivate an integration between FASTER and Timely Dataflow, with promising results demonstrating FASTER’s suitability as the state backend of choice for large stateful computations. Finally, we will show the early results from the integration of FASTER with Flink.

Apache Flink Worst Practices - Konstantin Knauf

Speedb/RocksDB - The Rise of LSM-Trees, Why Now?

FASTER: Efficient State Management for the Modern Edge-Cloud (Badrish Chandramouli)

RocksDB: A High Performance Embedded Key-Value Store for Flash Storage - Data@Scale

Massive Scale Data Processing at Netflix using Flink - Snehal Nagmote & Pallavi Phadnis

When You Think You're Microsoft…The Fall Of Salesforce

Managing State in Apache Flink - Tzu-Li (Gordon) Tai

The Stockholm Syndrome of SQL | Prime Reacts

Accelerating Performance: Mastering RocksDB for High Speed Data Processing by Varunkumar Nagarajan

InfluxDB Storage Engine Internals | Metamarkets

Flink's Table & DataStream API: A Perfect Symbiosis

Apache Flink 101 | Building and Running Streaming Applications

Turing Award Winner: Disagreeing with Google, Postgres, Future Problems | Mike Stonebraker

Architecting a Low-Latency Schemaless SQL Engine | Rockset

Storing State Forever: Why It Can Be Good For Your Analytics

Webinar: Deep Dive on Apache Flink State - Seth Wiesman

Something is jamming GPS over Europe. Here's what we found

Robust Stream Processing with Apache Flink

Introduction to Stateful Stream Processing with Apache Flink • Robert Metzger • GOTO 2019

