Low Level Technicals of LLMs: Daniel Han

This workshop will be split into 3x one hour blocks: How to analyze & fix LLMs - how to find and fix bugs in Gemma, Phi-3, Llama & tokenizers Finetuning with Unsloth - continued pretraining, reward modelling, QLoRA & more Deep dive into LLM technicals - hand deriving derivatives, SOTA finetuning tricks It's recommended you have Python with Pytorch and Unsloth installed (or use online Google Colab / Kaggle). College level maths and programming would be helpful. Recorded live in San Francisco at the AI Engineer World's Fair. See the full schedule of talks at https://www.ai.engineer/worldsfair/20... & join us at the AI Engineer World's Fair in 2025! Get your tickets today at https://ai.engineer/2025 About Daniel Hey I'm Daniel, the algos guy behind Unsloth. I love making LLM training go fast! We're the guys who fixed 8 of Google's Gemma bugs, a 2048 SWA Phi-3 issue, found tokenization issues and fixed untrained tokens with Llama-3, and I run Unsloth with my brother Michael! Our open source package makes finetuning of LLMs 2x faster and uses 70% less VRAM with no accuracy degradation. I used to work at NVIDIA making GPU algos go fast and helped NASA engineers process data from a Mars rover faster!

Everything you need to know about Fine-tuning and Merging LLMs: Maxime Labonne

Everything you need to know about Fine-tuning and Merging LLMs: Maxime Labonne

RL for Agents Workshop - Deep Dive on Training Agents with RL and Open Source

RL for Agents Workshop - Deep Dive on Training Agents with RL and Open Source

LLMs for Everyone | Pre-training, Fine-Tuning, Scaling RL, Open Source | Daniel Han, Unsloth

LLMs for Everyone | Pre-training, Fine-Tuning, Scaling RL, Open Source | Daniel Han, Unsloth

Billionaire's WARNING: I'm SELLING. The Crash Is Already Here!

Billionaire's WARNING: I'm SELLING. The Crash Is Already Here!

LLM Inference: Cost vs. Latency vs. Throughput

LLM Inference: Cost vs. Latency vs. Throughput

Yann LeCun: World Models: Enabling the next AI revolution

Yann LeCun: World Models: Enabling the next AI revolution

Faster Fine-Tuning & Smarter Local Models feat. Dan from Unsloth | Docker’s AI Guide to the Galaxy

Faster Fine-Tuning & Smarter Local Models feat. Dan from Unsloth | Docker’s AI Guide to the Galaxy

Full Archon Guide - Build AI Coding Harnesses That Actually Ship (LIVE)

Full Archon Guide - Build AI Coding Harnesses That Actually Ship (LIVE)

What rebuilding AlphaGo teaches us about self-play, RL, and future of LLMs - Eric Jang

What rebuilding AlphaGo teaches us about self-play, RL, and future of LLMs - Eric Jang

Mastering LLM Inference Optimization From Theory to Cost Effective Deployment: Mark Moyou

Mastering LLM Inference Optimization From Theory to Cost Effective Deployment: Mark Moyou

How to Start Coding | Programming for Beginners | Learn Coding | Intellipaat

How to Start Coding | Programming for Beginners | Learn Coding | Intellipaat

Andrej Karpathy: Software Is Changing (Again)

Andrej Karpathy: Software Is Changing (Again)

ICML 2024 Tutorial: Physics of Language Models

ICML 2024 Tutorial: Physics of Language Models

Should You Still Become a Software Engineer in 2026? GitHub VP

Should You Still Become a Software Engineer in 2026? GitHub VP

This is why Deep Learning is really weird.

This is why Deep Learning is really weird.

Japan – Schweden Highlights | Gruppe F, FIFA WM 2026 | sportstudio

Japan – Schweden Highlights | Gruppe F, FIFA WM 2026 | sportstudio

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Training Sand to Think: Artificial General Intelligence & Future of Physics

Training Sand to Think: Artificial General Intelligence & Future of Physics

Horace He: Building Machine Learning Systems for a Trillion Trillion Floating Point Operations

Horace He: Building Machine Learning Systems for a Trillion Trillion Floating Point Operations

A Hackers' Guide to Language Models

A Hackers' Guide to Language Models