Build Bigger With Small Ai: Running Small Models Locally
It's finally possible to bring the awesome power of Large Language Models (LLMs) to your laptop. This talk will explore how to run and leverage small, openly available LLMs to power common tasks involving data, including selecting the right models, practical use cases for running small models, and best practices for deploying small models effectively alongside databases. Bio: Jeffrey Morgan is the founder of Ollama, an open-source tool to get up and run large language models. Prior to founding Ollama, Jeffrey founded Kitematic, which was acquired by Docker and evolved into Docker Desktop. He has previously worked at companies including Docker, Twitter, and Google. ➡️ Follow Us LinkedIn: / small-data-sf X/Twitter : / smalldatasf Website: https://www.smalldatasf.com/ Discover how to run large language models (LLMs) locally using Ollama, the easiest way to get started with small AI models on your Mac, Windows, or Linux machine. Unlike massive cloud-based systems, small open source models are only a few gigabytes, allowing them to run incredibly fast on consumer hardware without network latency. This video explains why these local LLMs are not just scaled-down versions of larger models but powerful tools for developers, offering significant advantages in speed, data privacy, and cost-effectiveness by eliminating hidden cloud provider fees and risks. Learn the most common use case for small models: combining them with your existing factual data to prevent hallucinations. We dive into retrieval augmented generation (RAG), a powerful technique where you augment a model's prompt with information from a local data source. See a practical demo of how to build a vector store from simple text files and connect it to a model like Gemma 2B, enabling you to query your own data using natural language for fast, accurate, and context-aware responses. Explore the next frontier of local AI with small agents and tool calling, a new feature that empowers models to interact with external tools. This guide demonstrates how an LLM can autonomously decide to query a DuckDB database, write the correct SQL, and use the retrieved data to answer your questions. This advanced tutorial shows you how to connect small models directly to your data engineering workflows, moving beyond simple chat to create intelligent, data-driven applications. Get started with practical applications for small models today, from building internal help desks to streamlining engineering tasks like code review. This video highlights how small and large models can work together effectively and shows that open source models are rapidly catching up to their cloud-scale counterparts. It's never been a better time for developers and data analysts to harness the power of local AI. Watch with full transcript & resources: https://motherduck.com/videos/build-b...

An Evolving DAG for the LLM world - Julia Schottenstein of LangChain at Small Data SF

RAG vs. CAG: Solving Knowledge Gaps in AI Models

Yann LeCun's $1B Bet Against LLMs

The DuckLake Lakehouse: From Getting Started to Going Fast

Want to Run AI Agents Locally? Here is The Bare Minimum Setup/Build

The Best Local Agentic Coding Workflow (Complete Guide)

GraphRAG: The Marriage of Knowledge Graphs and RAG: Emil Eifrem

Running LLMs Locally Just Got Way Better - Ollama + MCP

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan

What Can a 500MB LLM Actually Do? You'll Be Surprised!

Why Google Just Gave Away Gemma 4 for Free

Small vs. Large AI Models: Trade-offs & Use Cases Explained

Don't learn AI Agents without Learning these Fundamentals

Attacking AI - Jason Haddix - NDC Security 2026

Feed Your OWN Documents to a Local Large Language Model!

Master Gemma 4 in 20 Minutes

Why AI Agents are either the best or worst thing we’ve ever built

Full Walkthrough: Workflow for AI Coding — Matt Pocock

Small language models with Google AI Edge

