Building Multimodal AI Agents From Scratch — Apoorva Joshi, MongoDB
In this hands-on workshop, you will build a multimodal AI agent capable of processing mixed-media content—from analyzing charts and diagrams to extracting insights from documents with embedded visuals. Using MongoDB as a vector database and memory store, and Google's Gemini for multimodal reasoning, you will gain hands-on experience with multimodal data processing pipelines and agent orchestration patterns by implementing core components directly, using good ol' Python. --- In this hands-on workshop, you will build a multimodal AI agent capable of processing mixed-media content—from analyzing charts and diagrams to extracting insights from documents with embedded visuals. Using MongoDB as a vector database and memory store, and Google's Gemini for multimodal reasoning, you will gain hands-on experience with multimodal data processing pipelines and agent orchestration patterns by implementing core components directly, using good ol' Python. You will be provided with a GitHub repository consisting of learning materials and resources required to successfully execute the hands-on portions of the workshop. --related links-- / apoorvajoshi95

The Agent Cloud: Databricks’ Bet on the Future of AI — Matei Zaharia and Reynold Xin

Architecting Agent Memory: Principles, Patterns, and Best Practices — Richmond Alake, MongoDB

demo

AI Agents for Beginners – Part 1 (Free Labs)

Anthropic Workshop: Build Agents That Run for Hours — Ash Prabaker & Andrew Wilson

CLAUDE CODE ADVANCED FULL COURSE (3 HOURS)

Building Multimodal AI Applications Using MongoDB & Voyage AI

Full Walkthrough: Workflow for AI Coding — Matt Pocock

Don't learn AI Agents without Learning these Fundamentals

Building AI Agents that actually work (Full Course)

Skill Issue: Andrej Karpathy on Code Agents, AutoResearch, and the Loopy Era of AI

12-Factor Agents: Patterns of reliable LLM applications — Dex Horthy, HumanLayer

Vertical AI Agents Could Be 10X Bigger Than SaaS

Tips for building AI agents

Claude Architect: Multi-Agent Orchestration

Billionaire's WARNING: I'm SELLING. The Crash Is Already Here!

How AI agents & Claude skills work (Clearly Explained)

AI Agents in 38 Minutes - Complete Course from Beginner to Pro

Build a Full-Stack GenAI Project in 4 Hours (FastAPI, React, Supabase)

