How RWKV-7 "Goose" and It's Linear Inference Work with Author Eugene Cheah

Paper 📜 https://arxiv.org/abs/2503.14456 Links + Notes 📝 https://www.oxen.ai/blog/how-rwkv-7-g... Join Arxiv Dives 🤿 https://oxen.ai/community Discord 🗿   / discord   Use Oxen AI 🐂 https://oxen.ai/ Oxen AI makes versioning your datasets as easy as versioning your code! Even is millions of unstructured images, the tool quickly handles any type of data so you can build cutting-edge AI. -- Chapters 0:00 Why is RWKV-7 Goose interesting 2:53 How to quickly run RWKV-7 Goose 4:04 What is RWKV-7 10:20 RNN’s forget things 12:33 First paper: Reinventing RNNs for the Transformer Era 24:22 Paper author Eugene Cheah joins the dive 36:43 The intuition behind each model layer 47:57 Parallelization during training 53:01 How well did RWKV-7 do on benchmarks? 56:50 Live evals on RWKV-7 and fine-tuning tips 1:00:38 Why they made the World Tokenizer