Scaling Retrieval-Augmented Generation in Production using Semantic Caching
Sravani Lingam presents the talk "Scaling Retrieval-Augmented Generation in Production using Semantic Caching" at the 2026 Applied Machine Learning Conference in Charlottesville, Virginia. For more information, please see the session page on the conference website: https://appliedml.us/2026/sessions/sc...

▶︎
Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan

▶︎
The Future of OCR? Structured Text Extraction with LLMs

▶︎
Human-Interpretable ML Explanations in High-Stakes Graph ML

▶︎
Beyond A/B Testing: Practical Contextual Bandits for Dynamic Pricing in Production

▶︎
KV Cache: The Invisible Trick Behind Every LLM

▶︎
Will We Hit Our Target? Real-Time Probabilistic Forecasting in Production

▶︎
Zig 2026: No-AI Policy, $670K Foundation, Left GitHub & Why Zig Isn’t 1.0 - Andrew Kelley Explains

▶︎
Knife Expert: Real Knife Defense Is TERRIFYING

▶︎
Conan O’Brien Delivers the Commencement Address | Harvard Commencement 2026

▶︎
80% of ML Projects Fail - How to Let Them

▶︎
Stop Rambling: The 3-2-1 Speaking Trick That Makes You Sound Like A CEO

▶︎
My Golden Retriever Heals a Terrified Rescue Kitten in Just 3 Meetings!

▶︎
From Data to Signals: A Journey Using AI and Agents

▶︎
Turing Award Winner: Data Abstraction, Dijkstra, Distributed Systems | Barbara Liskov

▶︎
Yann LeCun's $1B Bet Against LLMs

▶︎
"A.I. and Our Economic Future," Professor Chad Jones

▶︎
The Insane Genius of a Formula 1 Gearbox

▶︎
It’s Not Like The Movies: Managing Uncertainty When Tracking Objects in the Real World

▶︎
Andrej Karpathy: Software Is Changing (Again)

▶︎
