Building Multimodal AI Applications Using MongoDB & Voyage AI

Code Along with us in DataLab! https://bit.ly/4o1i5wJ Resources (including link to notebook + slides): https://bit.ly/3IYdeML Session Pre-requisites - MongoDB cluster setup: Register for a free MongoDB Atlas account https://www.mongodb.com/cloud/atlas/r... Create a new database cluster https://www.mongodb.com/docs/guides/a... Obtain the connection string for your database cluster https://www.mongodb.com/docs/guides/a... Obtain a Voyage AI API key: Follow the steps here to get a Voyage AI API key. https://docs.voyageai.com/docs/api-ke... Obtain a Gemini API key: Follow the steps here to get a Gemini API key via Google AI Studio. https://ai.google.dev/gemini-api/docs... As AI applications expand beyond text, the ability to work with image, video, and other modalities is becoming a must-have skill. Building effective multimodal systems requires not only the right models, but also the right infrastructure to store, retrieve, and serve diverse data types at scale. In this code-along, Apoorva Joshi, a Senior AI Developer Advocate at MongoDB, will teach you how to build a simple multimodal AI application using MongoDB and Voyage AI. You’ll learn how to structure and query image and video data, apply retrieval techniques with Voyage AI, and connect everything in a functional pipeline. This session is ideal for data scientists and AI engineers looking to expand their application-building toolkit.