How to Stop Your AI from Making Things Up (RAG)

Every LLM hallucinates. ChatGPT, Claude, Gemini , doesn't matter how big the model. Ask it about your private docs or your company's policies and it will confidently invent an answer that sounds correct and is completely wrong. The fix isn't a smarter model. The fix is RAG , the system that forces your LLM to retrieve real context before it generates a single word. This session is the full breakdown. This is Week 7 of TAI's 12-week AI Engineering cohort. Our instructor Dr Akshika (Aki) Wijesundara walks through the full RAG mental model: chunking strategies, retrieval, re-ranking, the live build, and the eval metrics nobody talks about. By the end you'll know exactly what every part of a RAG pipeline does and which decisions actually move the needle. 🎓 JOIN THE NEXT COHORT → https://theaiinternship.com/ • 12 weeks, live mentor-led sessions • Build real projects: RAG, agents, fine-tuning, MCP, capstone • Small batches, direct access to instructors • Career support + portfolio reviews ━━━━━━━━━━━━━━━━━━━━━ What you'll understand in 47 minutes: ✓ Why every LLM hallucinates (and why fine-tuning doesn't fix it) ✓ The full RAG pipeline: chunk → embed → store → retrieve → re-rank → generate ✓ Five chunking strategies and when to pick each (fixed / document / semantic / recursive / agentic) ✓ Pre-chunking vs post-chunking — and why it matters ✓ Live demo: building a working RAG pipeline from scratch ✓ The eval metrics most tutorials skip (hit@K, NDCG, BLEU, exact match) ✓ How to safeguard your LLM from bad queries ✓ Tracking + observability with Langfuse 🕐 CHAPTERS 0:00 Welcome — what is RAG? 2:33 RAG = Retrieval Augmented Generation 3:03 Why LLMs hallucinate without context 4:42 The RAG workflow, high-level 8:51 The full RAG pipeline 9:11 Chunking — your first big decision 13:44 Three chunking strategies (fixed / document / semantic) 17:06 Pre-chunking vs post-chunking 18:21 Recursive + hierarchical chunking 19:52 LLM-based + agentic chunking 22:23 Parent + child chunking 24:14 Retrieval — the second pillar 26:03 Live demo — building a RAG pipeline 28:46 Generating embeddings 29:55 Creating the vector database 33:48 Retrieval step deep-dive 35:51 The RAG pipeline object 37:59 Why RAG is hard — it's probabilistic 38:40 RAG evaluation: hit@K, NDCG, BLEU 43:09 Safeguarding your LLM from bad queries 44:19 This week's homework 45:36 Q&A — Langfuse + tracking 46:53 Wrap 🛠️ TOOLS MENTIONED • Weaviate — https://weaviate.io • Pinecone — https://pinecone.io • Langfuse (tracking + eval) — https://langfuse.com • LangChain RAG — https://python.langchain.com/docs/tut... • sentence-transformers — https://www.sbert.net • OpenAI Embeddings — https://platform.openai.com/docs/guid... ━━━━━━━━━━━━━━━━━━━━━ 📚 THE FULL CURRICULUM W01 — Environment Setup & Your First OpenAI Call W02 — Build a ChatGPT Clone (Two Ways) W03 — REST APIs, JWT & FastAPI (Unhackable Backend) W04 — Vector Embeddings & Semantic Search W05 — Fine-Tuning ChatGPT (Turn It Into Your Company Intern) W08 — LangChain & LangGraph W09 — Build an MCP Server W11–W12 — Capstone project + showcase → Apply: https://theaiinternship.com/ About TAI — The AI Internship We train engineers to ship real AI products. 12-week mentor-led cohorts, real codebases, real deployment, real career outcomes. #RAG #AIHallucination #LLM #RetrievalAugmentedGeneration #AIEngineering #VectorDatabase #ChatGPT #TheAIInternship #BuildWithAI #PromptEngineering

Is RAG Still Needed? Choosing the Best Approach for LLMs

Is RAG Still Needed? Choosing the Best Approach for LLMs

Is AI Hiding Its Full Power? With Geoffrey Hinton

Is AI Hiding Its Full Power? With Geoffrey Hinton

How AI agents & Claude skills work (Clearly Explained)

How AI agents & Claude skills work (Clearly Explained)

Your Agent Should Start Before You Ask

Your Agent Should Start Before You Ask

Feed Your OWN Documents to a Local Large Language Model!

Feed Your OWN Documents to a Local Large Language Model!

Learn 97% of Claude in Under 16 Minutes

Learn 97% of Claude in Under 16 Minutes

You Can Be an AI Engineer in 35 Minutes (Here's How)

You Can Be an AI Engineer in 35 Minutes (Here's How)

Harnesses in AI: A Deep Dive — Tejas Kumar, IBM

Harnesses in AI: A Deep Dive — Tejas Kumar, IBM

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan

Mythos 5 new ability is AGI...

Mythos 5 new ability is AGI...

CLI vs MCP: How AI Agents Choose the Right Tool for the Job

CLI vs MCP: How AI Agents Choose the Right Tool for the Job

Don't learn AI Agents without Learning these Fundamentals

Don't learn AI Agents without Learning these Fundamentals

Exposing The Solid State Donut Battery. It's Over.

Exposing The Solid State Donut Battery. It's Over.

We Tested Anthropic’s Fable 5 for a Week

We Tested Anthropic’s Fable 5 for a Week

The AI PM Playbook: Context, Evals, Cost & Shipping

The AI PM Playbook: Context, Evals, Cost & Shipping

RAG vs Fine-Tuning vs Prompt Engineering: Optimizing AI Models

RAG vs Fine-Tuning vs Prompt Engineering: Optimizing AI Models

Anthropic Workshop: Build Agents That Run for Hours — Ash Prabaker & Andrew Wilson

Anthropic Workshop: Build Agents That Run for Hours — Ash Prabaker & Andrew Wilson

RAG Crash Course for Beginners

RAG Crash Course for Beginners

AI Tools Every Product Manager Should Know in 2026

AI Tools Every Product Manager Should Know in 2026