Pinterest's ML Evolution: Distributed Training with Ray | Ray Summit 2024
Pinterest's success hinges on its advanced recommender systems, which power key products like Homefeed, Related Pins, and Ads. In this session, Saurabh Vishwas Joshi and Jiun-Yu Lee from Pinterest's ML Platform team reveal how they've harnessed Ray to optimize data loading for their data-intensive recommender model training. The presenters detail Pinterest's journey in leveraging Ray and the Ray Data ecosystem to decompose and orchestrate massive workloads across thousands of training jobs. They discuss the challenges faced, including memory pinning and multi-threaded collate, and the innovative solutions implemented to scale data-loading beyond trainer nodes. Learn how these optimizations have significantly boosted training throughput and how Pinterest is addressing new challenges through internal abstractions and open-source contributions. -- Interested in more? Watch the full Day 1 Keynote: • Ray Summit 2024 Keynote Day 1 | Where Buil... Watch the full Day 2 Keynote • Ray Summit 2024 Keynote Day 2 | Where Buil... -- 🔗 Connect with us: Subscribe to our YouTube channel: / @anyscale Twitter: https://x.com/anyscalecompute LinkedIn: / joinanyscale Website: https://www.anyscale.com

Transforming Multimodal Data Management with LanceDB-Ray | Ray Summit 2024

Everything I Learned Training Frontier Small Models — Maxime Labonne, Liquid AI

Distributed Model Training with Ray at Capital One | Ray Summit 2025

Meta's Roadmap for Full Stack AI: Insights from Joe Spisak | Ray Summit 2024

Optimizing Large-Scale RL with SGLang | Chenyang Zhao | AER Labs

NVIDIA Made Everyone Mad with the RTX Spark

Nothing has changed about software engineering | Ben Eggers | Bug Bash 2026

RAG vs. CAG: Solving Knowledge Gaps in AI Models

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan

How Coinbase Uses Ray, vLLM & LiteLLM to Power Secure LLM Services | Ray Summit 2025

Webinar: Getting Started with Distributed Training at Scale

Why Ray Became a Distributed Computing Engine for Modern AI

LIVE: Conan O’Brien speaks at Harvard graduation ceremony (full)

Andrej Karpathy: Software Is Changing (Again)

Turing Award Winner: Disagreeing with Google, Postgres, Future Problems | Mike Stonebraker

Yann LeCun's $1B Bet Against LLMs

AI Agents for Beginners – Part 1 (Free Labs)

Multimodal data: Architecting pipelines that don’t break at scale

MCP vs ADK: How Modern AI Agents Connect and Work Together

