CMU Advanced NLP Spring 2025 (20): Advanced Post-Training

This lecture (by Sean Welleck) for CMU CS 11-711, Advanced NLP covers: Supervised Fine-tuning Reward Modeling Reinforcement Learning Direct Preference Optimization

CMU Advanced NLP Spring 2025 (21): Multimodal Modeling I

CMU Advanced NLP Spring 2025 (21): Multimodal Modeling I

CMU Advanced NLP Spring 2025 (11): Reinforcement Learning

CMU Advanced NLP Spring 2025 (11): Reinforcement Learning

Nirupam Gupta - Robust Distributed Learning : A Quest to Learning in Untrusted Environments - 2

Nirupam Gupta - Robust Distributed Learning : A Quest to Learning in Untrusted Environments - 2

CMU Advanced NLP Spring 2025 (16): Parallelism and Scaling

CMU Advanced NLP Spring 2025 (16): Parallelism and Scaling

Blockchain Educational Video - Smart Contract Security

Blockchain Educational Video - Smart Contract Security

Training Sand to Think: Artificial General Intelligence & Future of Physics

Training Sand to Think: Artificial General Intelligence & Future of Physics

RLHF, PPO & GRPO Explained: A Top-Down Guide to LLM Policy Optimization

RLHF, PPO & GRPO Explained: A Top-Down Guide to LLM Policy Optimization

CMU Advanced NLP Fall 2024 (3): Language and Sequence Modeling

CMU Advanced NLP Fall 2024 (3): Language and Sequence Modeling

CMU Advanced NLP Spring 2026 (1): Introduction & Fundamentals

CMU Advanced NLP Spring 2026 (1): Introduction & Fundamentals

Rotary Positional Embeddings: Combining Absolute and Relative

Rotary Positional Embeddings: Combining Absolute and Relative

CMU Advanced NLP Fall 2024 (6): Instruction Tuning

CMU Advanced NLP Fall 2024 (6): Instruction Tuning

CMU Advanced NLP Spring 2025 (19): Efficient Inference

CMU Advanced NLP Spring 2025 (19): Efficient Inference

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

CMU Advanced NLP Fall 2024 (23): MagicPIG & Factor - Methods for Long Context LMs

CMU Advanced NLP Fall 2024 (23): MagicPIG & Factor - Methods for Long Context LMs

CMU Advanced NLP Fall 2024 (8): Reinforcement Learning and Human Feedback

CMU Advanced NLP Fall 2024 (8): Reinforcement Learning and Human Feedback

CMU Advanced NLP Spring 2025 (1): Introduction to NLP

CMU Advanced NLP Spring 2025 (1): Introduction to NLP

Bridging Associative Memory and Probabilistic Modeling

Bridging Associative Memory and Probabilistic Modeling

CMU LLM Inference (1): Introduction to Language Models and Inference

CMU LLM Inference (1): Introduction to Language Models and Inference