Ogunyale & Olu-Ipinlaye - Teach an LLM to Navigate the Web | Pydata London 26

Oreolorun Olu-Ipinlaye & Richard Kehinde Ogunyale - Building a Browser Agent from Scratch: Teach an LLM to Navigate the Web AI systems that can autonomously navigate websites, fill forms, extract data, and complete multi-step workflows; are one of the most exciting and practical applications of large language models in 2026. Libraries like browser-use (60k+ GitHub stars) and Skyvern have demonstrated their potential, but their abstractions can obscure the surprisingly approachable fundamentals underneath. In this 90-minute hands-on tutorial, attendees will build a browser agent entirely from scratch using only Python, Playwright, and an LLM API. No agent frameworks, no magic; just the core building blocks: extracting and structuring the DOM into an LLM-friendly representation, capturing screenshots for vision-based reasoning, building the observe-think-act agent loop, and handling real-world challenges like dynamic content, multi-tab navigation, and error recovery. By building from first principles, attendees will gain a deep understanding of how browser agents actually work; knowledge that transfers directly to using, debugging, and extending any browser agent framework. Every participant will leave with a working agent that can autonomously complete tasks on live websites. This tutorial is aimed at Python developers and data scientists who are curious about AI-driven browser automation. Basic Python proficiency and familiarity with async/await are expected. No prior experience with Playwright, browser automation, or agent frameworks is required. The web is the world’s largest API, but it was designed for humans, not machines. Traditional browser automation tools like Selenium and Playwright require developers to write brittle scripts with hardcoded selectors that break whenever a website changes its layout. Browser agents flip this model: instead of telling the browser exactly what to click, you describe what you want to accomplish, and an LLM figures out how to do it; reading the page like a human would, reasoning about what to do next, and adapting when things don’t go as expected. This approach has seen explosive growth. The open-source browser-use library surpassed 60,000 GitHub stars within months of release, and its creators raised $17M in seed funding. Skyvern, Browserbase, and others have built commercial platforms around the same idea. Under the hood, these tools all share a remarkably similar architecture: a perception layer that converts web pages into LLM-readable context, a reasoning layer where the LLM decides what action to take, and an execution layer that carries out the action via browser automation. This tutorial strips away the abstraction layers and builds each component from scratch. The “from scratch” approach is deliberate: by understanding how the DOM is parsed, how screenshots are fed to vision models, and how the agent loop manages state, attendees gain transferable knowledge that applies to any browser agent tool or framework. When something breaks in production (and it will), this understanding is what separates debugging from guessing. www.pydata.org PyData is an educational program of NumFOCUS, a 501(c)3 non-profit organization in the United States. PyData provides a forum for the international community of users and developers of data analysis tools to share ideas and learn from each other. The global PyData network promotes discussion of best practices, new approaches, and emerging technologies for data management, processing, analytics, and visualization. PyData communities approach data science using many languages, including (but not limited to) Python, Julia, and R. PyData conferences aim to be accessible and community-driven, with novice to advanced level presentations. PyData tutorials and talks bring attendees the latest project features along with cutting-edge use cases. 00:00 Welcome! 00:10 Help us add time stamps or captions to this video! See the description for details. Want to help add timestamps to our YouTube videos to help with discoverability? Find out more here: https://github.com/numfocus/YouTubeVi...

Keynote-Martin O'Reilly - LLMs and AI agents demystified | Pydata London 26

Keynote-Martin O'Reilly - LLMs and AI agents demystified | Pydata London 26

Is RAG Still Needed? Choosing the Best Approach for LLMs

Is RAG Still Needed? Choosing the Best Approach for LLMs

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan

How AI agents & Claude skills work (Clearly Explained)

How AI agents & Claude skills work (Clearly Explained)

Prattyush Mangal - Production-Ready AI Agents: From LLMs to Small Language Models | Pydata London 26

Prattyush Mangal - Production-Ready AI Agents: From LLMs to Small Language Models | Pydata London 26

We let AI buy a robot and a car, it does exactly what experts warned.

We let AI buy a robot and a car, it does exactly what experts warned.

China’s Secret | The Most Unbelievable Megaprojects in China | 4K Travel Documentary

China’s Secret | The Most Unbelievable Megaprojects in China | 4K Travel Documentary

Don't learn AI Agents without Learning these Fundamentals

Don't learn AI Agents without Learning these Fundamentals

Crintea - Making Databases LLM-Ready Building Semantic Layers with Semantido | Pydata London 26

Crintea - Making Databases LLM-Ready Building Semantic Layers with Semantido | Pydata London 26

How To Think SO CLEARLY People Assume You're A Genius

How To Think SO CLEARLY People Assume You're A Genius

Chris Fonnesbeck - Flexible Statistical Modeling | Pydata London 26

Chris Fonnesbeck - Flexible Statistical Modeling | Pydata London 26

Hitendri Bomble-The Silent Crash:Why Your RAG Evaluation Metrics Are Lying to You | PyData London 26

Hitendri Bomble-The Silent Crash:Why Your RAG Evaluation Metrics Are Lying to You | PyData London 26

Sujee Maniyam- Using coding agents with open models | Pydata London 26

Sujee Maniyam- Using coding agents with open models | Pydata London 26

RL for Agents Workshop - Deep Dive on Training Agents with RL and Open Source

RL for Agents Workshop - Deep Dive on Training Agents with RL and Open Source

Microsoft Fabric and Power BI - Developer of the Future⚡ [Full Course]

Microsoft Fabric and Power BI - Developer of the Future⚡ [Full Course]

Inside the Mind of Anthropic CEO Dario Amodei | The Circuit | Extended Interview

Inside the Mind of Anthropic CEO Dario Amodei | The Circuit | Extended Interview

Transformers, the tech behind LLMs | Deep Learning Chapter 5

Transformers, the tech behind LLMs | Deep Learning Chapter 5

God Says:"I JUST CONFIRMED — ONLY YOU CAN SEE THIS LETTER"/God Message Now/God Message

God Says:"I JUST CONFIRMED — ONLY YOU CAN SEE THIS LETTER"/God Message Now/God Message

This is not the AI we were promised | The Royal Society

This is not the AI we were promised | The Royal Society

What the Armor of God Really Means When You Feel Too Weak to Fight (No Ads)

What the Armor of God Really Means When You Feel Too Weak to Fight (No Ads)