How to make LLMs fast: KV Caching, Speculative Decoding, and Multi-Query Attention | Cursor Team
Lex Fridman Podcast full episode: • Cursor Team: Future of Programming with AI... Thank you for listening ❤ Check out our sponsors: https://lexfridman.com/sponsors/cv806... See below for guest bio, links, and to give feedback, submit questions, contact Lex, etc. GUEST BIO: Aman Sanger, Arvid Lunnemark, Michael Truell, and Sualeh Asif are creators of Cursor, a popular code editor that specializes in AI-assisted programming. CONTACT LEX: Feedback - give feedback to Lex: https://lexfridman.com/survey AMA - submit questions, videos or call-in: https://lexfridman.com/ama Hiring - join our team: https://lexfridman.com/hiring Other - other ways to get in touch: https://lexfridman.com/contact EPISODE LINKS: Cursor Website: https://cursor.com Cursor on X: https://x.com/cursor_ai Anysphere Website: https://anysphere.inc/ Aman's X: https://x.com/amanrsanger Aman's Website: https://amansanger.com/ Arvid's X: https://x.com/ArVID220u Arvid's Website: https://arvid.xyz/ Michael's Website: https://mntruell.com/ Michael's LinkedIn: https://bit.ly/3zIDkPN Sualeh's X: https://x.com/sualehasif996 Sualeh's Website: https://sualehasif.me/ SPONSORS: To support this podcast, check out our sponsors & get discounts: Encord: AI tooling for annotation & data management. Go to https://lexfridman.com/s/encord-cv806... MasterClass: Online classes from world-class experts. Go to https://lexfridman.com/s/masterclass-... Shopify: Sell stuff online. Go to https://lexfridman.com/s/shopify-cv80... NetSuite: Business management software. Go to https://lexfridman.com/s/netsuite-cv8... AG1: All-in-one daily nutrition drinks. Go to https://lexfridman.com/s/ag1-cv8062-sb PODCAST LINKS: Podcast Website: https://lexfridman.com/podcast Apple Podcasts: https://apple.co/2lwqZIr Spotify: https://spoti.fi/2nEwCF8 RSS: https://lexfridman.com/feed/podcast/ Podcast Playlist: • Lex Fridman Podcast Clips Channel: / lexclips SOCIAL LINKS: X: https://x.com/lexfridman Instagram: / lexfridman TikTok: / lexfridman LinkedIn: / lexfridman Facebook: / lexfridman Patreon: / lexfridman Telegram: https://t.me/lexfridman Reddit: / lexfridman

Cursor Team: Future of Programming with AI | Lex Fridman Podcast #447

KV Cache in 15 min

Debugging with AI: Why finding bugs is hard | Cursor Team and Lex Fridman

Scientists just realised the universe is bigger than we thought — and it changes everything

Speculative Decoding and Efficient LLM Inference with Chris Lott - 717

Transformers, the tech behind LLMs | Deep Learning Chapter 5

We Don't Need KV Cache Anymore?

Scaling KV Caches for LLMs: How LMCache + NIXL Handle Network and Storage...- J. Jiang & M. Khazraee

How to scale AI quickly | Cursor Team and Lex Fridman

How Cursor code editor works | Cursor Team and Lex Fridman

They Lied to You About AI (This Study Proves It)

The KV Cache: Memory Usage in Transformers

What is Prompt Caching? Optimize LLM Latency with AI Transformers

Yann LeCun's $1B Bet Against LLMs

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan

Most devs don't understand how LLM tokens work

Speculative Decoding Explained

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Claude vs GPT vs o1: Which AI is best at programming? | Cursor Team and Lex Fridman

