How to Choose a Vector Database
Noé Achache of Theodo - Data & AI joins us to present How to Choose a Vector Database in 2023. Noé explores the evolving landscape of vector databases in the context of rising interest in LLMs and Generative AI. He offers a comparison of various vector databases, advising readers on choosing between integrated vector search tools like PGVector and knn search for existing databases versus dedicated vector databases such as Pinecone, Qdrant, Weaviate, Milvus, and ChromaDB, for cost and latency concerns. The discussion covers indexing algorithms, emphasizing HNSW and IVF, followed by an in-depth comparison of the vector databases. Finally, a practical example of using a vector database with DVC will be shown, to iterate on your vectors while using the same stack as in your production pipeline. At the end of the presentation, you should have more clarity on selecting the right vector database based on individual requirements and technical infrastructure. C.f. this article from Noé for a first taste of the talk: https://www.sicara.fr/blog-technique/... Link to slides: https://docs.google.com/presentation/... Additional content from Noé - TextBoxGan: G enerating text boxes to train OCRs with a GAN • TextBoxGan: G enerating text boxes to trai... Learn more about Sicara here: https://www.sicara.fr/en/ Try out the DVC Extension for VS Code here: https://marketplace.visualstudio.com/... To learn more about Iterative's open-source and SaaS tools please visit: 🧑🏽💻 Our free online course: https://learn.iterative.ai ✍🏼 Our docs: https://dvc.org/doc (Data Version Control, Pipelines, Experiments) https://cml.dev/doc (CI/CD for Machine Learning) https://mlem.ai/doc (Package and Serve your models) https://studio.iterative.ai (Team Collaboration, Experiments, Model Registry) Join the Community on our Discord server: / discord #dvc #machinelearning #datascience #generativeai

Empowering Fast and Reproducible Machine Learning - Denis Stalz-John at Dida Conference 2023

Great Practices for Retrieval Augmented Generation (RAG) in Production

How to Create Scalable and Distributed Workflows with DVC and Ray

The Future of IT Ops: AI-Driven Tools You Can Build Today by Pierre Roman

What is a Vector Database? Powering Semantic Search & AI Applications

Something is jamming GPS over Europe. Here's what we found

More Designs, Same Standards by Remi Denoyer , Lead Data Scientist, Behaviorally

What do tech pioneers think about the AI revolution? - The Engineers, BBC World Service

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan

Deutschland – Curaçao Highlights | Gruppe E, FIFA WM 2026 | sportstudio

The World's Most Important Machine

GraphRAG: The Marriage of Knowledge Graphs and RAG: Emil Eifrem

Computer Vision Data Annotation & Preparation Using DVCx

How ASML Makes Chips Faster With Its New $400 Million High NA Machine

Achieving Production-level Performance in RAG with DSPy, Parea, and DVC

Leading in the Age of AI: A Conversation with NVIDIA CEO Jensen Huang | Global Conference 2026

DataChain Open-Source Release - A new way to manage your Unstructured Data

How AI agents & Claude skills work (Clearly Explained)

