Benchmark Qwen 3.6 27b & 35b - 5060 ti 16GB

Software: Lmstudio Hardware: GPU: 5060ti CPU: AMD Ryzen 5 7600 Storage: 1TB NVMe SSD (Inland) Motherboard: ASUS TUF Gaming B650-E WIFI RAM: 32GB DDR5 (2x16GB) 6000MHz C32 PSU: Mars Gaming 1000W 80+ Gold Cooler: ARCTIC Freezer 36 A-RGB Case: Phanteks XT Pro Ultra (White) asahi.w Optimization: : Qwen3.5 35b a3b and Qwen3.6 35b a3b (being an MoE model) has a huge speed advantage. Your gpu offload need to be set to Full. (40) Instead, change your "Number of Layers to force the Expert Into CPU" setting. Set this mostly to the right until it fills them to system ram instead. Then simply slide them back down to the left by 4 each time until you hit the idle vram use. We will be testing two very different architectures: the dense 27B and the massive 35B MoE (A3B). With only 16GB of VRAM, we’ll see how many layers we can offload to the GPU and if the tokens per second (tok/s) are enough for a real-world workflow. 📊 Models tested in this video: Qwen 3.6 27B (Dense): A powerhouse for logic and complex instructions. How much can we squeeze into 16GB? Qwen 3.6 35B-A3B (MoE): The Mixture of Experts version. It has more parameters, but does it run faster than the 27B? ⏱️ Timestamps: 0:00 Big thanks 0:10 Qwen 3.6 MoE No optimization 1:18 Results 1:28 Qwen 3.6 MoE 35b - optimized 2:26 Amazing Results 2:38 Qwen 3.6 27B 5060ti 3:52 Results 3 4:02 Next video Subscribe for Ep. 4 where we take these Qwen models to the RTX 3060 12GB! #RTX5060Ti #Qwen36 #Qwen3.6 #AI #LocalLLM #NVIDIA #Jordutech #GPUBenchmark #TechReview #OpenSourceAI #LMStudio #AlibabaCloud #ArtificialIntelligence #MachineLearning #PCGaming #VRAM #Quantization #MoE #DeepLearning #RTX3060 #SmartTech

Qwen 3.6 27B is a MONSTER, but can it run locally? I tested it on an RTX 5090 and RTX 5060 Ti and...

Qwen 3.6 27B is a MONSTER, but can it run locally? I tested it on an RTX 5090 and RTX 5060 Ti and...

Build Powerful Local Coding Agent on Budget GPU with Llama.cpp and Pi

Build Powerful Local Coding Agent on Budget GPU with Llama.cpp and Pi

Make Windows 11 200% Faster With This ONE Setting (Its Insane)

Make Windows 11 200% Faster With This ONE Setting (Its Insane)

Qwen 3.6 27B vs 35B-A3B: 16GB VRAM Local Test

Qwen 3.6 27B vs 35B-A3B: 16GB VRAM Local Test

Should You Buy nVidia RTX 5060ti 16gb GPU for Local AI? Qwen 3.6 Agents?

Should You Buy nVidia RTX 5060ti 16gb GPU for Local AI? Qwen 3.6 Agents?

Leave Windows 11 Idle for 24 Hours and Watch What Happens

Leave Windows 11 Idle for 24 Hours and Watch What Happens

Not even close… 16GB Rx 9060 XT vs RTX 5060 Ti

Not even close… 16GB Rx 9060 XT vs RTX 5060 Ti

Wildflower Meadow | Turn Your TV Into Art | Vintage Oil Painting Screensaver | TV Framed Art | 3 Hrs

Wildflower Meadow | Turn Your TV Into Art | Vintage Oil Painting Screensaver | TV Framed Art | 3 Hrs

QWEN3.5 Local LLM test | RTX 50 series 16GB VRAM | Fast, small and good local LLM is it possible?

QWEN3.5 Local LLM test | RTX 50 series 16GB VRAM | Fast, small and good local LLM is it possible?

Meta’s AI Clusterf*ck Is Humiliating Zuckerberg

Meta’s AI Clusterf*ck Is Humiliating Zuckerberg

Welcome Home Sign / 4K / Wallpaper / Screensaver / TV Art / Frame Art

Welcome Home Sign / 4K / Wallpaper / Screensaver / TV Art / Frame Art

Android 17 sucks. So I put Linux on a phone.

Android 17 sucks. So I put Linux on a phone.

Running a 35B AI Model on 6GB VRAM, FAST (llama.cpp Guide)

Running a 35B AI Model on 6GB VRAM, FAST (llama.cpp Guide)

Testing Qwen 3.6 35B vs 27b Locally on a Project

Testing Qwen 3.6 35B vs 27b Locally on a Project

I tested PewDiePie's AI platform...

I tested PewDiePie's AI platform...

This Ridiculous $200 AI GPU Shouldn’t Be This Good

This Ridiculous $200 AI GPU Shouldn’t Be This Good

I ran 80B model on 16GB GPU - It's surprisingly good! (Qwen 3 Coder Next Review)

I ran 80B model on 16GB GPU - It's surprisingly good! (Qwen 3 Coder Next Review)

Wild Flowers in Enchanted Forest - Vintage Oil Painting | 1 Hour 4K Framed Art Screensaver for TV

Wild Flowers in Enchanted Forest - Vintage Oil Painting | 1 Hour 4K Framed Art Screensaver for TV

Qwen 3.6 27b Local Ai Review and Benchmark

Qwen 3.6 27b Local Ai Review and Benchmark

Qwen 3.6 on RTX 5090: Maxing out the 27B and 35B MoE models

Qwen 3.6 on RTX 5090: Maxing out the 27B and 35B MoE models