VibeThinker 3B - Taking on Giant Models

In this video, I look at VibeCoder 3b and how it is beating some models that are 300x its size on certain benchmarks by improving its reasoning and chain of thought to be better for specific use cases. While the model is not for production it shows what could be done with these techniques. Thanks to Dell for Sponsoring the Compute #DellProPrecision #DellProMax Paper: https://arxiv.org/abs/2606.16140 Weights: https://huggingface.co/WeiboAI/VibeTh... Github: https://github.com/WeiboAI/VibeThinker Twitter: https://x.com/Sam_Witteveen 🕵️ Interested in building LLM Agents? Fill out the form below Building LLM Agents Form: https://drp.li/dIMes 👨‍💻Github: https://github.com/samwit/llm-tutorials ⏱️Time Stamps: 00:00 Intro 01:16 VibeThinker-3B 03:33 Benchmarks 05:16 VibeThinker-3B Paper 05:46 Architecture 09:00 Demo