Analytics for not-so-big data with DuckDB - David Ostrovsky - NDC Oslo 2025
This talk was recorded at NDC Oslo in Oslo, Norway. #ndcoslo #ndcconferences #developer #softwaredeveloper Attend the next NDC conference near you: https://ndcconferences.com https://ndcoslo.com/ Subscribe to our YouTube channel and learn every day: / @NDC Follow our Social Media! / ndcconferences / ndc_conferences / ndc_conferences #database #architecture #bigdata #python #csv #sql In the past decade the industry has seen hundreds of new databases. Most of these newcomers are operational databases, meant for online workloads and being a primary datastore for applications. A handful of new databases are meant for analytical use-cases, mainly large scale big data workloads. Which makes DuckDB an interesting exception, because it's built for workloads that are too big for traditional databases, but not so big that they justify complicated big data tools. It's a lightweight, open-source, analytical database for people with gigabytes or single terabytes of data, not companies with hundreds of terabytes and teams of data engineers. In this session we'll take DuckDB out for a test drive with live demos and discussion of interesting use-cases. We'll see how to use it to quickly run analytical queries on data from multiple data sources. We'll look at how to use DuckDB to transform and manipulate diverse datasets, such as turning a bunch of raw CSV data in S3 into a set of tables in MySQL with a single command. We'll check out its embedded capabilities, by running the database directly inside a Python application. And finally, we'll build a quick-and-dirty Data Lake by using DuckDB, without any complicated big data tools.

How DuckLake Simplifies Lakehouse Architecture ft. Jordan Tigani & Hannes Mühleisen

DuckDB + PostgreSQL: Building a Data Warehouse Inside Your Favorite Database

Let’s Get Certified: DP-800 | Session 2 | Securing, Optimizing, and Deploying Database Solutions

Understanding Big Data for Software Engineers - David Ostrovsky - NDC Oslo 2023

Designing Data-Intensive Applications: Chapters 1 and 2

DuckDB, Apache Arrow, & the Future of Data Engineering w/ Rusty Conover | S2E3

Building a Postgres Data Warehouse using DuckDB

How To Use `constexpr` In C++23 - Jason Turner - NDC TechTown 2024

DuckDB: How to Build 100x Faster Analytics Databases (with Co-Creator Hannes Mühleisen)

Scaling DuckDB in the cloud with MotherDuck CEO Jordan Tigani

DuckDB Co-Creator Hannes Mühleisen on Why Single-Node Beats Distributed

What is DuckDB? The Database Changing Data Engineering Forever

Can DuckDB Replace Your Data Stack? (MotherDuck Co-Founder Ryan Boyd)

Modern Architecture 101 for New Engineers & Forgetful Experts - Jerry Nixon - NDC Copenhagen 2025

A perfect match: Dapr & Azure Container Apps - Sander Molenkamp - NDC London 2023

DuckLake Deep Dive: Build a Full Lakehouse with Just Parquet Files and DuckDB

DuckDB, AI, and the Future of Data Engineering | with Staff Engineer, Matt Martin

Xavier Fischer: Deep dive into the PostgreSQL FrontEnd/BackEnd protocol (PGDay Armenia 2026)

Gábor Szárnyas - DuckDB: The Power of a Data Warehouse in your Python Process

