AI for data engineers with Simon Willison | Talking Postgres Ep30
It’s always a good day if you see a pelican. In Episode 30 of Talking Postgres with Claire Giordano, open source developer Simon Willison—creator of Datasette and co-creator of Django—joins to explore how AI is useful for data engineers today. We move past the hype and boosterism to dig into example after example: structured data extraction, alt text and accessibility, safety and security (aka the fiddly bits), and why Postgres’s fine-grained permissions are such a good fit for AI-powered workflows. Also: Pulitzer-worthy data tooling, the science fiction of the 10X engineer, agents, MCP, RAG, the multitude of models, and why Simon spends so many waking hours on the jagged frontier of AI. Chapters: ⏩ 00:00 Introducing Simon ⏩ 02:51 Commodore 64 ⏩ 03:49 Origin of Django ⏩ 11:21 Converging LLMs & data journalism ⏩ 12:52 Unreliable sources ⏩ 14:52 Stunningly good at SQL ⏩ 17:45 AI enables ambitious side projects ⏩ 20:33 Science fiction of 10x engineer? ⏩ 21:20 Art of using LLMs & spotting opportunities ⏩ 23:12 Accessibility and Gen AI podcast ⏩ 27:43 Structured data extraction ⏩ 31:20 Video input for Gemini models ⏩ 32:34 Biggest improvement in last 6 months ⏩ 35:29 Safety & security ⏩ 35:58 Postgres is fantastic for this ⏩ 39:50 AI terminology primer ⏩ 53:34 Monthly spend on LLMs ⏩ 54:17 Pelicans on bicycles ⏩ 1:03:55 Honeybadgering with GitHub Codespaces 📜 Full transcript available at: https://talkingpostgres.com/episodes/... ✅ Listen to more episodes of Talking Postgres: https://talkingpostgres.com 💥 Subscribe to Talking Postgres, so you never miss an episode: https://talkingpostgres.com/subscribe Links mentioned in this episode: 🔹 Blog: Simon Willison’s Weblog: https://simonwillison.net/ 🔹 Blog: Simon’s Willison’s TIL - Things I’ve Learned: https://til.simonwillison.net/ 🔹 Podcast episode: Ep01 of Talking Postgres with Simon Willison & Marco Slot: • Working in public on open source with Simo... 🔹 Django project: https://www.djangoproject.com/ 🔹 Datasette: https://datasette.io/ 🔹 GitHub repo for llm, a CLI tool and Python library: https://github.com/simonw/llm 🔹 Demo of llm CLI tool: • Language models on the command-line w/ Sim... 🔹 Blog post: OpenAI’s new open weight (Apache 2) models are really good, by Simon Willison: https://simonwillison.net/2025/Aug/5/... 🔹 Podcast episode: Accessibility and Gen AI with guest Simon Willison: • Ep 6 - Simon Willison - Creator, Datasette 🔹 Blog post: New dashboard: alt text for all my images, by Simon Willison: https://simonwillison.net/2025/Apr/28... 🔹 Keynote at Citus Con: An Event for Postgres 2023, by Simon Willison: • KEYNOTE: Big Opportunities in Small Data |... 🔹 Blog post: How OpenElections Uses LLMs, by Derek Willis: https://thescoop.org/archives/2025/06... 🔹 Blog posts tagged with pelican-riding-a-bicycle on Simon Willison’s Weblog: https://simonwillison.net/tags/pelica... 🔹 Blog post: “No, AI is not Making Engineers 10x as Productive” via Colton Voege: https://simonwillison.net/2025/Aug/6/... 🔹 GitHub repo for the pgvector extension to Postgres: https://github.com/pgvector/pgvector 🔹 Calendar invite: LIVE recording of Ep31 of Talking Postgres to happen on Wed Sep 17, 2025: https://aka.ms/TalkingPostgres-Ep31-cal #TalkingPostgres #podcast #PostgreSQL

What went wrong (& what went right) with AIO with Andres Freund | Talking Postgres Ep31

AI is coming for your job. Here’s what to do now, with Simon Willison | The Truth of the Matter

Software architecture, human judgment, and AI's limits with Grady Booch

Databricks Live Bootcamp | Day1: Introduction & Data Analytics

A week in privacy with Paul and K

Zig 2026: No-AI Policy, $670K Foundation, Left GitHub & Why Zig Isn’t 1.0 - Andrew Kelley Explains

Turing Award Winner: Disagreeing with Google, Postgres, Future Problems | Mike Stonebraker

How I got started leading database teams with Shireesh Thota | Talking Postgres Ep29

AI Is Creating A Rare Opportunity For Investors. How Jim Roppel Is Playing It. | Investing With IBD

12 years of Postgres Weekly with Peter Cooper | Talking Postgres Ep28

Full Archon Guide - Build AI Coding Harnesses That Actually Ship (LIVE)

How to Start Coding | Programming for Beginners | Learn Coding | Intellipaat

Keynote: After the AI Hype – What’s Real, and What’s Next - Richard Campbell - 2026

Simon Willison: Using LLMs for Python Development | Real Python Podcast #236

Designing Data-Intensive Applications: Chapters 1 and 2

Inside Anthropic, the $965 Billion AI Juggernaut | The Circuit

Full App Building Course with Cursor (3+ Hours)

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan

Building a dev experience for Postgres in VS Code with Rob Emanuele | Talking Postgres Ep33

