VibeVoice (Speech Generation/Voice Cloning) on Framework Desktop with Strix Halo (AMD AI Ryzen MAX+)

In this video, I show how to generate natural-sounding speech locally on the Framework Desktop with AMD Ryzen AI Max “Strix Halo” — including cloning a voice from a short sample and creating multi-speaker conversations. The intro you hear at the start was entirely generated by VibeVoice, cloned from my own voice. VibeVoice is Microsoft’s open-weight model for long-form, multi-speaker speech (released late August 2025). I’ll walk you through setup on Strix Halo using a Fedora toolbox and the Gradio UI, then demo single-speaker and multi-speaker clips, plus zero-shot voice cloning. I’ll also cover stability fixes for ROCm crashes. Timestamps: 00:00 — AI-Generated Intro (VibeVoice) 01:47 — Setup on Strix Halo (Toolbox + Gradio) 03:28 — First Demo: Single-Speaker 05:18 — Multi-Speaker Conversations 05:42 — Clone Your Own Voice (Zero-Shot) 06:23 — Stability Fixes (librosa / numba / LLVM / ROCm) 08:26 — Generating a Full Podcast 09:33 — AI-Generated Podcast: How VibeVoice Works — — — Links & Resources: GitHub repo (toolboxes, scripts, stability fixes): https://github.com/kyuz0/amd-strix-ha... Framework Desktop (Strix Halo): https://frame.work/ Strix Halo Homelab guide + Discord (by deseven): https://strixhalo-homelab.d7.wtf/ VibeVoice (project): https://github.com/microsoft/VibeVoice https://microsoft.github.io/VibeVoice/ VibeVoice models (Hugging Face): https://huggingface.co/microsoft/Vibe... (Community mirror example for large weights): https://huggingface.co/aoi-ot/VibeVoi... Gradio (UI framework): https://github.com/gradio-app/gradio Librosa (audio features): https://github.com/librosa/librosa Numba (JIT; disabled in this toolbox fix): https://github.com/numba/numba LLVM (compiler backend): https://llvm.org/

Robotics' End Game: Nvidia's Jim Fan

Robotics' End Game: Nvidia's Jim Fan

Low-Latency Strix Halo Cluster with RDMA (RoCE/Intel E810) and vLLM, Framework Desktop Boards

Low-Latency Strix Halo Cluster with RDMA (RoCE/Intel E810) and vLLM, Framework Desktop Boards

DeepSeek V4 Flash Inference on Strix Halo: ds4, Quantizations, Distributed Inference and Benchmarks

DeepSeek V4 Flash Inference on Strix Halo: ds4, Quantizations, Distributed Inference and Benchmarks

Local Coding Agents on Strix Halo and R9700: Pi, Opencode, and SWE-bench Mini Benchmarks

Local Coding Agents on Strix Halo and R9700: Pi, Opencode, and SWE-bench Mini Benchmarks

Why 24/7 AI Agents Are a Waste of Time

Why 24/7 AI Agents Are a Waste of Time

🔴 Framework Desktop with Ryzen Al Max+ 395

🔴 Framework Desktop with Ryzen Al Max+ 395

I Don't Think I Can Go Back To Windows...

I Don't Think I Can Go Back To Windows...

NVIDIA DGX Spark – A Non-Sponsored Review (Strix Halo Comparison, Pros & Cons)

NVIDIA DGX Spark – A Non-Sponsored Review (Strix Halo Comparison, Pros & Cons)

Microsoft Just Released Their Own Linux Distro: Should You Be Worried?

Microsoft Just Released Their Own Linux Distro: Should You Be Worried?

Running vLLM on Strix Halo (AMD Ryzen AI MAX) + ROCm Performance Updates

Running vLLM on Strix Halo (AMD Ryzen AI MAX) + ROCm Performance Updates

Recursive Self Improvement

Recursive Self Improvement

Run Qwen Image and WAN 2.2 on Framework Desktop with Strix Halo (AMD AI Ryzen MAX+ 395) - Full Guide

Run Qwen Image and WAN 2.2 on Framework Desktop with Strix Halo (AMD AI Ryzen MAX+ 395) - Full Guide

I’m glad I didn’t invest, so I can talk the Framework Desktop

I’m glad I didn’t invest, so I can talk the Framework Desktop

Google’s AI Search Just Exposed The Whole Sh*tshow

Google’s AI Search Just Exposed The Whole Sh*tshow

MTP (Multi-Token Prediction): 2x Faster Token Generation on AMD Strix Halo & Radeon 9700 AI Pro

MTP (Multi-Token Prediction): 2x Faster Token Generation on AMD Strix Halo & Radeon 9700 AI Pro

I Think They Are Lying To You

I Think They Are Lying To You

ROCm+Linux Support on Strix Halo: It's finally stable in 2026!

ROCm+Linux Support on Strix Halo: It's finally stable in 2026!

DeepMind Was Two Steps Ahead, AGAIN!

DeepMind Was Two Steps Ahead, AGAIN!

AMD's Most Powerful APU Yet - Strix Halo/Ryzen AI Max+ 395 - GMKTec Evo-X2 Review

AMD's Most Powerful APU Yet - Strix Halo/Ryzen AI Max+ 395 - GMKTec Evo-X2 Review

Finetuning LLMs on Strix Halo – Full, LoRA, and QLoRA on Gemma-3, Qwen-3, and GPT-OSS-20B

Finetuning LLMs on Strix Halo – Full, LoRA, and QLoRA on Gemma-3, Qwen-3, and GPT-OSS-20B