Can a Small Local AI Model Triage Real Email? Python + Ollama Agent Test

Can a small local AI model do real executive assistant work on a regular laptop? In this video, I continue my local AI agent experiment by building a communication triage pipeline. The goal is to see whether a small model can sort email-shaped data into useful categories like needs reply, needs action, waiting on, follow up, and FYI — then turn that into a readable briefing and action handoff files. This is not a polished production framework or a full Python coding tutorial. It is an experiment build. I’m testing what small local models can actually do, where they fail, what settings and guardrails help, and how the agent workflow improves over time. In this episode: Build the email triage pipeline with test data Move from Markdown output to strict JSON output Tune num_predict and num_ctx Add batching for larger message sets Test whether threading helps Create output paths for a briefing and action handoff files Swap in real Gmail data Clean and filter noisy real email before sending it to the model Compare Qwen, Llama, Gemma, and Gemini on the same workflow The series hypothesis: A small local AI model can handle useful work tasks on a regular laptop when given clear inputs, tight limits, and controlled access to data. Local models tested: Qwen3:4B Qwen3:1.7B Llama3.2:3B Gemma3:4B Cloud benchmark: Gemini 2.5 Flash The big takeaway: small local models can do real work, but the surrounding system matters. The connector, cleanup, prompt, batching, schema, and output files are what make the workflow usable. This series is for people interested in local AI, practical AI agents, Python workflows, Ollama, privacy-focused AI, and realistic experiments with small models. 0:00 Intro: Can a Small Model Triage Email? 2:08 Part 1: Building the Triage Pipeline 4:09 Prompt Structure + Output Categories 6:11 Tuning the Local Model 8:33 Adding More Data + Batching 10:02 Does Threading Help? 11:05 Briefing + Action Handoffs 13:23 Adjusting Prompt Rules 14:06 Part 2: Testing Real Gmail Data 15:40 Part 3: Local vs Cloud Model Comparison

The Local AI Hardware Mistake Everyone Makes

The Local AI Hardware Mistake Everyone Makes

This Tool Forces AI To Write Good Code

This Tool Forces AI To Write Good Code

In Person: Making AI faster and safer with Docker by Michael Irwin

In Person: Making AI faster and safer with Docker by Michael Irwin

Understand-Anything vs Graphify: I Tested Both on My SaaS

Understand-Anything vs Graphify: I Tested Both on My SaaS

I Tested Every Claude Code Feature, These 12 Are the Best

I Tested Every Claude Code Feature, These 12 Are the Best

Why Did Huawei Build Its Own Programming Language? | Prof. Dan Ghica at OCX 2026

Why Did Huawei Build Its Own Programming Language? | Prof. Dan Ghica at OCX 2026

Running LLMs Locally Just Got Way Better - Ollama + MCP

Running LLMs Locally Just Got Way Better - Ollama + MCP

Canada Just Sent A FATAL Warning To The World - Brace For Impact

Canada Just Sent A FATAL Warning To The World - Brace For Impact

I Re-Created A Quant Trading Strategy With Claude Code (Insanely Cool)

I Re-Created A Quant Trading Strategy With Claude Code (Insanely Cool)

The Engineering Behind Training a 2 Trillion Parameter LLM

The Engineering Behind Training a 2 Trillion Parameter LLM

How To Build A Self-Improving AI Trading Agent (Insanely Cool)

How To Build A Self-Improving AI Trading Agent (Insanely Cool)

🧹Watch me CLEAN DATA in Minutes with Python (+10 Tips for Complex Datasets)

🧹Watch me CLEAN DATA in Minutes with Python (+10 Tips for Complex Datasets)

Harnesses in AI: A Deep Dive — Tejas Kumar, IBM

Harnesses in AI: A Deep Dive — Tejas Kumar, IBM

They Lied to You About AI (This Study Proves It)

They Lied to You About AI (This Study Proves It)

My Golden Retriever Heals a Terrified Rescue Kitten in Just 3 Meetings!

My Golden Retriever Heals a Terrified Rescue Kitten in Just 3 Meetings!

This is probably the biggest misconception about Python

This is probably the biggest misconception about Python

Poison Your Data. Fight Back Against AI.

Poison Your Data. Fight Back Against AI.

Nvidia: Not for humans anymore?

Nvidia: Not for humans anymore?

The Best Local Agentic Coding Workflow (Complete Guide)

The Best Local Agentic Coding Workflow (Complete Guide)

You're Automating The Wrong Layer (How 30,000 People Build AI Without Frameworks)

You're Automating The Wrong Layer (How 30,000 People Build AI Without Frameworks)