
▶︎
Apollo Research: Q & A on 'Frontier Models are Capable of In-Context Scheming', Alex & Marius Q&A.

▶︎
Reflection AI’s Misha Laskin on the AlphaGo Moment for LLMs | Training Data

▶︎
Overview of Version 15: Useful AI and New Core Functionality

▶︎
Interpretability via Symbolic Distillation

▶︎
The FASTEST introduction to Reinforcement Learning on the internet

▶︎
Causal AI for real-world public health decisions

▶︎
Jacob Andreas | What Learning Algorithm is In-Context Learning?

▶︎
Reinforcement Learning Series: Overview of Methods

▶︎
Inside the Mind of Anthropic CEO Dario Amodei | The Circuit | Extended Interview

▶︎
Sergey Levine, Data-Driven RL in Robotics, Language, and Beyond Share, 15.Feb.2023

▶︎
Turing Award Winner: Disagreeing with Google, Postgres, Future Problems | Mike Stonebraker

▶︎
Peter Stone - Practical Reinforcement Learning: Lessons from 30 Years of Research - RLC 2024

▶︎
UNBOXING THE FUTURE OF HEALTHCARE (SOTA OF AI) DAY 2

▶︎
Transformers As Statisticians: Provable In-Context Learning With In-Context Algorithm Selection
![[GRPO Explained] DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models](https://i.ytimg.com/vi/bAWV_yrqx4w/hqdefault.jpg?sqp=-oaymwEjCNACELwBSFryq4qpAxUIARUAAAAAGAElAADIQj0AgKJDeAE=&rs=AOn4CLDokrriuR2L23xh1Ef15w23TimFRw)
▶︎
[GRPO Explained] DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

▶︎
RL for Agents Workshop - Deep Dive on Training Agents with RL and Open Source

▶︎
What Is In-Context Learning in Deep Learning?

▶︎
Training Sand to Think: Artificial General Intelligence & Future of Physics

▶︎
Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan

▶︎
