ModelParallelism Tensor Parallism

peered inside the transformer and saw matrix multiplication everywhere: Y = X × W. two beautiful properties: Column split: X × [W₁ | W₂] = [X×W₁ | X×W₂] Row split: X × [W₁] = X₁×W₁ + X₂×W₂ [W₂] Applied to a transformer MLP block (Linear1 → GELU → Linear2): GPU 0: X → Linear1_colA → GELU → Linear2_rowA ─┐ ├─ All-Reduce → Output GPU 1: X → Linear1_colB → GELU → Linear2_rowB ─┘ By pairing column-parallel with row-parallel, we only needed one All-Reduce per block. For attention it was even cleaner: each GPU just owned a subset of heads. But TP has a dark side: its All-Reduces are in the critical path. You can't hide them. That's why TP only works well inside a single node with NVLink typically TP ≤ 8. A companion trick, Sequence Parallelism, splits the leftover operations (LayerNorm, Dropout) along the sequence dimension to keep activations sharded everywhere.

Germany's New Photonic XPU Just Made Nvidia & AMD GPUs Look Like Paper Weights!

Germany's New Photonic XPU Just Made Nvidia & AMD GPUs Look Like Paper Weights!

Quantum Computing Is a Lie (Here’s What I Discovered)

Quantum Computing Is a Lie (Here’s What I Discovered)

🔴 LIVE Barred Owl Nest Cam 🦉 | Post-Fledge Updates & Owl Activity

🔴 LIVE Barred Owl Nest Cam 🦉 | Post-Fledge Updates & Owl Activity

1 IN A MILLION MOMENTS IN SPORTS !

1 IN A MILLION MOMENTS IN SPORTS !

⚡The AI Memory Wall Explained: How HBM, NVIDIA, and South Korea Control the Next AI Bottleneck🧠

⚡The AI Memory Wall Explained: How HBM, NVIDIA, and South Korea Control the Next AI Bottleneck🧠

Simon Cowell Was Skeptical... Then She Left Him Speechless | BGT 2026

Simon Cowell Was Skeptical... Then She Left Him Speechless | BGT 2026

Sacha 'Borat' Baron Cohen Asks Melanie "What Her Price Is" | Friday Night With Jonathan Ross

Sacha 'Borat' Baron Cohen Asks Melanie "What Her Price Is" | Friday Night With Jonathan Ross

China's Chip Breakthrough Terrifies Taiwan and America

China's Chip Breakthrough Terrifies Taiwan and America

6 Humanoids You Can Actually Buy in 2026!

6 Humanoids You Can Actually Buy in 2026!

Unix vs Linux difference explained for Beginners

Unix vs Linux difference explained for Beginners

EEVblog 1752 - Texas Instruments SCREWED UP the NE5532!

EEVblog 1752 - Texas Instruments SCREWED UP the NE5532!

Rowan Atkinson's Brilliant Humor Leaves Celebrities in Tears!

Rowan Atkinson's Brilliant Humor Leaves Celebrities in Tears!

THIS Is What Happens When You Attack a US Aircraft Carrier

THIS Is What Happens When You Attack a US Aircraft Carrier

I Trusted Brave for 2 Years. Then I Ran Wireshark.

I Trusted Brave for 2 Years. Then I Ran Wireshark.

NVIDIA Monopoly is DEAD | OPEN-SOURCE Chips Are HERE!

NVIDIA Monopoly is DEAD | OPEN-SOURCE Chips Are HERE!

I made a GPU at home

I made a GPU at home

Planet of the Apes: The Banned Ending They Hide For 60 Years

Planet of the Apes: The Banned Ending They Hide For 60 Years

Rufus JUST DESTROYED Windows 11 As Millions Watch Microsoft COLLAPSE!

Rufus JUST DESTROYED Windows 11 As Millions Watch Microsoft COLLAPSE!

macOS Is Technically Better Than Linux. Here's Why It Doesn't Matter.

macOS Is Technically Better Than Linux. Here's Why It Doesn't Matter.

Mike Brewer Reveals The Truth About What Happened to Wheeler Dealers

Mike Brewer Reveals The Truth About What Happened to Wheeler Dealers