Vector Databases: Embeddings, Semantic Search, and Hybrid Retrieval - Alexey Grigorev
In this workshop, Alexey Grigorev, founder of DataTalks.Club, dives deep into the technical shift from lexical to semantic retrieval in modern AI applications. We explore the evolution of RAG pipelines, specifically moving from basic text search to advanced vector search architectures and production-ready deployments. Links: LLM Zoomcamp: https://github.com/DataTalksClub/llm-... Materials: https://github.com/DataTalksClub/llm-... Guardrails workshop: https://aishippinglabs.com/workshops/... You’ll learn about: Why meaning-based search beats keyword matching How to shift from lexical to semantic retrieval in RAG pipelines How to implement vector search with PGVector and SQLite How to use ONNX to make AI models 33x lighter and faster How to manage large datasets with Approximate Nearest Neighbors (ANN) How to automatically filter irrelevant or out-of-scope queries A clear guide to Embeddings, Cosine Similarity, and Scoring TIMECODES: 0:00 Evolution from Text Search to Vector Search 7:33 RAG Architecture and Prompt Engineering with Context 11:26 Environment Setup for Sentence Transformers and PyTorch 16:45 Fundamentals of Word and Sentence Embeddings 20:09 Utilizing Sentence-BERT for Semantic Similarity 25:16 Mathematics of Vector Similarity and Cosine Similarity 28:17 FAQ Dataset Embedding and Batch Processing Workflows 33:53 High-Performance Matrix Multiplication for Similarity Scoring 38:18 Implementing Top-K Search and Argmax using NumPy 43:23 MinSearch Integration and Metadata Filtering Strategies 48:54 Hybrid Search Advantages and Lexical-Semantic Integration 52:30 Refactoring RAG Pipelines for Vector Search Swapping 57:59 Benchmarking and Justification for Vector Search Deployment 1:02:47 Approximate Nearest Neighbors vs Exact Search Performance 1:07:01 Decoupling Data Ingestion from Production Deployment 1:12:44 Vector Index Implementation with SQLite Search 1:18:48 Production Vector Search using PGVector and Postgres 1:23:44 LLM Guardrails and Filtering Irrelevant User Queries 1:35:13 ONNX Model Optimization for Lightweight Production AI Connect with DataTalks.Club: Join the community - https://datatalks.club/slack.html Subscribe to our Google calendar to have all our events in your calendar - https://calendar.google.com/calendar/... Check other upcoming events - https://lu.ma/dtc-events GitHub: https://github.com/DataTalksClub LinkedIn - / datatalks-club Twitter - / datatalksclub Website - https://datatalks.club/ Connect with Alexey Twitter - / al_grigor Linkedin - / agrigorev Check our free online courses: ML Engineering course - http://mlzoomcamp.com Data Engineering course - https://github.com/DataTalksClub/data... MLOps course - https://github.com/DataTalksClub/mlop... LLM course - https://github.com/DataTalksClub/llm-... Open-source LLM course: https://github.com/DataTalksClub/open... AI Dev Tools course: https://github.com/DataTalksClub/ai-d... 👉🏼 Read about all our courses in one place - https://datatalks.club/blog/guide-to-... 👋🏼 Support/inquiries If you want to support our community, use this link - https://github.com/sponsors/alexeygri... If you’re a company, reach us at [email protected] ag #vectorsearch #semanticsearch #llmzoomcamp #aiengineering #nlp #embeddings #pgvector #postgres #sqlite #onnx #aioptimization #machinelearning #generativeai #datatalksclub #python #searchinfrastructure #cosinesimilarity #hybridsearch #semanticretrieval

From RAG to AI Agents: Function Calling and Tool Use - Alexey Grigorev

PyFlink Stream Processing Tutorial: Build a Real-Time Pipeline with Kafka, Redpanda and Python

AWS Tech & AI Essentials — Build a real AI workflow - Day 5 of 5

Is RAG Still Needed? Choosing the Best Approach for LLMs

Understanding the AI Engineer Role - Nasser Qadri

How to Evaluate MCP-powered AI Agents Beyond Accuracy using Agent GPA - Josh Reini

The Agent Development Lifecycle: Build, Test, Deploy, Monitor | Interrupt 26

RL for Agents Workshop - Deep Dive on Training Agents with RL and Open Source

What is SonarQube | Introduction SonarQube | SonarQube Tutorial | SonarQube Basics | Intellipaat
![Microsoft Fabric and Power BI - Developer of the Future⚡ [Full Course]](https://i.ytimg.com/vi/ohKpl80obzU/hqdefault.jpg?sqp=-oaymwEjCNACELwBSFryq4qpAxUIARUAAAAAGAElAADIQj0AgKJDeAE=&rs=AOn4CLC7OUcS43Tjw7PcWR1n6T-ncrgsdA)
Microsoft Fabric and Power BI - Developer of the Future⚡ [Full Course]

Build Your First RAG Application with LLMs - Alexey Grigorev

Practical AI Coding Agent Evaluation with SWE-bench, TeamCity, and Juni | Ernst Haagsman

Build a Complete Medical Chatbot with LLMs, LangChain, Pinecone, Flask & AWS 🔥

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan

Starting a Data Conference: The Data Makers Fest Story - Leonid Kholkine

RAG and Agents Evaluation: Measuring Retrieval and LLM Answer Quality - Alexey Grigorev

Object Oriented Programming | OOPS in Python | OOPS Tutorial | Intellipaat

LLM Zoomcamp 2026 Pre-Course Live Q&A - Alexey Grigorev

The Complete Guide to Hybrid Search in RAG (BM25 + Embeddings + Reranker)

