InfluxDB Storage Engine Internals | Metamarkets
Recorded at DataEngConf SF '17 InfluxDB is an open source time series database developed over the last 3 years. In that time we've tried different storage engines starting with LevelDB and testing out HyperLevelDB, RocksDB and BoltDB. Over a year ago we made the decision to write our own storage engine from scratch. Inspired by the LSM Tree underlying LevelDB and its variants, we created a new storage engine we're calling the TSM Tree (Time Structured Merge Tree). Over the last eight months we've added to this storage engine to provide index capabilities for mapping metadata to underlying time series. This talk will briefly cover our journey with other storage engines and why we ultimately decided to write our own from scratch. The underlying InfluxDB storage engine is more like two storage engines in one: a time series storage engine and an inverted index for metadata. This talk will dive into the details about how each of these systems work, their design considerations and lessons learned along the way. We'll cover compression techniques for columnar time series storage, Robin Hood Hashing for quickly index lookups, and sketches for estimation of series cardinality at scale. Speaker: Paul Dix, Metamarkets ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers. Make sure to subscribe to our channel for more videos, including DC_THURS, our series of live online interviews with leading data professionals from top open source projects and startups. FOLLOW DATA COUNCIL: Twitter: / datacouncilai LinkedIn: / datacouncil-ai Facebook: / datacouncilai Eventbrite: https://www.eventbrite.com/o/data-cou... - 🎟️ GET YOUR TICKET TO AI COUNCIL 2026 🎟️ Meet the world's top AI infrastructure minds where architects of AI share what works. Three days of high-quality technical talks and meaningful interactions. → https://aicouncil.com/sf-2026 ⚡ FIND US: X: https://x.com/AICouncilConf LinkedIn: / aicouncilconf Website: https://aicouncil.com/

CockroachDB: Architecture of a Geo-Distributed SQL Database | Cockroach Labs

Anomaly Detection for Data Quality and Metric Shifts at Netflix | Netflix

Time Series Database Lectures #1 - Paul Dix (InfluxDB)

RocksDB: A High Performance Embedded Key-Value Store for Flash Storage - Data@Scale

Turing Award Winner: Disagreeing with Google, Postgres, Future Problems | Mike Stonebraker

Scaling Write-Intensive Key-Value Stores

Rearchitecting a SQL Database for Time-Series Data | TimescaleDB
![Michael Desa [InfluxData] | InfluxDB 101 – Concepts and Architecture | InfluxDays London 2019](https://i.ytimg.com/vi/S1kuOyS8FHY/hqdefault.jpg?sqp=-oaymwEjCNACELwBSFryq4qpAxUIARUAAAAAGAElAADIQj0AgKJDeAE=&rs=AOn4CLBj88FpP3Axju3HMGpmsS8xtKWzZw)
Michael Desa [InfluxData] | InfluxDB 101 – Concepts and Architecture | InfluxDays London 2019

Algorithms behind Modern Storage Systems

Something is jamming GPS over Europe. Here's what we found

Recursive Self Improvement

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan

Jfrog | Jfrog Artifactory | Jfrog Artifactory Tutorial | Artifactory Tutorial | Intellipaat

DropBox Engineering Evening on RocksDB with Dhruba Borthakur @ Rockset

Kafka: A Modern Distributed System
![D. Richard Hipp - SQLite [The Databaseology Lectures - CMU Fall 2015]](https://i.ytimg.com/vi/gpxnbly9bz4/hqdefault.jpg?sqp=-oaymwE9CNACELwBSFryq4qpAy8IARUAAAAAGAElAADIQj0AgKJDeAHwAQH4Af4JgALQBYoCDAgAEAEYZSBlKGUwDw==&rs=AOn4CLBg9t3UnpojWFsIRtAkZFfn3fTVHA)
D. Richard Hipp - SQLite [The Databaseology Lectures - CMU Fall 2015]

System Design Concepts Course and Interview Prep

Nathan Bronson: Improving RocksDB Write Scalability

How Superset and Druid Power Real-Time Analytics at Airbnb | DataEngConf SF '17

