CMU Advanced NLP Spring 2025 (20): Advanced Post-Training
This lecture (by Sean Welleck) for CMU CS 11-711, Advanced NLP covers: Supervised Fine-tuning Reward Modeling Reinforcement Learning Direct Preference Optimization

▶︎
CMU Advanced NLP Spring 2025 (21): Multimodal Modeling I

▶︎
CMU Advanced NLP Spring 2025 (11): Reinforcement Learning

▶︎
Nirupam Gupta - Robust Distributed Learning : A Quest to Learning in Untrusted Environments - 2

▶︎
CMU Advanced NLP Spring 2025 (16): Parallelism and Scaling

▶︎
Blockchain Educational Video - Smart Contract Security

▶︎
Training Sand to Think: Artificial General Intelligence & Future of Physics

▶︎
RLHF, PPO & GRPO Explained: A Top-Down Guide to LLM Policy Optimization

▶︎
CMU Advanced NLP Fall 2024 (3): Language and Sequence Modeling

▶︎
CMU Advanced NLP Spring 2026 (1): Introduction & Fundamentals

▶︎
Rotary Positional Embeddings: Combining Absolute and Relative

▶︎
CMU Advanced NLP Fall 2024 (6): Instruction Tuning

▶︎
CMU Advanced NLP Spring 2025 (19): Efficient Inference

▶︎
Visualizing transformers and attention | Talk for TNG Big Tech Day '24

▶︎
CMU Advanced NLP Fall 2024 (23): MagicPIG & Factor - Methods for Long Context LMs

▶︎
CMU Advanced NLP Fall 2024 (8): Reinforcement Learning and Human Feedback

▶︎
CMU Advanced NLP Spring 2025 (1): Introduction to NLP

▶︎
Bridging Associative Memory and Probabilistic Modeling

▶︎
