TEST TIME Optimized AI REASONING (MIT)

Optimized Test-Time Training by ‪@mit‬ : Shaping AI’s Future in Reasoning. This brilliant video introduces a novel approach to improving reasoning capabilities in large language models (LLMs) through Test-Time Training (TTT) with a Leave-One-Out (LOO) strategy, specifically applied to the Abstraction and Reasoning Corpus (ARC). ARC tasks require abstract pattern recognition and rule inference, often with only a few input-output examples. TTT addresses this by dynamically fine-tuning lightweight Low-Rank Adapters (LoRA) at inference time. The method deconstructs the main task into independent subtasks, using LOO to exclude one test input-output pair while fine-tuning on the remaining pairs and augmented data. This fine-tuning adapts the model to the specific logic of each task, enabling the LLM to better generalize abstract transformations while avoiding information leakage from the excluded pair. The augmentation process enriches the limited examples with transformations like flips, rotations, and rule-based variations, ensuring robust task-specific adaptation. This dynamic TTT process contrasts with static pre-training or in-context learning by actively updating model parameters during inference. Unlike in-context learning, which leverages examples directly as input without parameter updates, TTT uses the auxiliary dataset to fine-tune LoRA adapters for each subtask independently. This enables the model to handle ARC’s unique challenges, such as generalizing from minimal data and adapting to task-specific reasoning rules. Achieving a state-of-the-art accuracy of 53% on ARC validation, the approach demonstrates significant performance improvements over baseline methods and offers a scalable framework for abstract reasoning tasks, especially in few-shot scenarios. All rights w/ authors: The Surprising Effectiveness of Test-Time Training for Abstract Reasoning https://arxiv.org/pdf/2411.07279v1 00:00 Optimization of Test Time Training 01:08 ARC Intelligence test for AI 02:37 3 Insights into TTT 05:17 Test Time Dataset Creation 08:05 This is not ICL 09:47 Pre-train - Finetune - LoRA Adapter 13:00 ARC Dataset Characteristics 15:54 9000% Human AI 17:47 Leave One Out training 20:17 Cheating? 22:37 Limitations on TTT* 25:21 AI Agents and Security 26:30 Combine w Reward Policy MCTS #reasoning #ai #massachusettsinstituteoftechnology #training #aieducation #robot

Test-Time Training Adapt: Novel Policy-Reward w/ MCTS

Test-Time Training Adapt: Novel Policy-Reward w/ MCTS

LoRA & QLoRA Fine-tuning Explained In-Depth

LoRA & QLoRA Fine-tuning Explained In-Depth

The Strange Math That Predicts (Almost) Anything

The Strange Math That Predicts (Almost) Anything

17 AI Models Tested on REAL Scientific Research

17 AI Models Tested on REAL Scientific Research

Digitization and Digital Archiving: Foundations of Digital Stewardship

Digitization and Digital Archiving: Foundations of Digital Stewardship

RAG vs. Fine Tuning

RAG vs. Fine Tuning

Yann LeCun's $1B Bet Against LLMs [Part 1]

Yann LeCun's $1B Bet Against LLMs [Part 1]

Understanding and Effectively Using AI Reasoning Models

Understanding and Effectively Using AI Reasoning Models

Learning at test time in LLMs [Jonas Hübotter]

Learning at test time in LLMs [Jonas Hübotter]

START YOUR TUESDAY WITH FAITH | TODAY GOD IS GIVING YOU UNEXPECTED OPPORTUNITIES | FATHER FREDDY ...

START YOUR TUESDAY WITH FAITH | TODAY GOD IS GIVING YOU UNEXPECTED OPPORTUNITIES | FATHER FREDDY ...

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan

AlphaFold - The Most Useful Thing AI Has Ever Done

AlphaFold - The Most Useful Thing AI Has Ever Done

LLM Attention That Expands At Inference? Test Time Training Explained

LLM Attention That Expands At Inference? Test Time Training Explained

Training Sand to Think: Artificial General Intelligence & Future of Physics

Training Sand to Think: Artificial General Intelligence & Future of Physics

Decoding on Graphs: Empower LLMs with KGs (MIT)

Decoding on Graphs: Empower LLMs with KGs (MIT)

Anthropic Just Dropped Fable 5 And It’s Terrifying

Anthropic Just Dropped Fable 5 And It’s Terrifying

LLMs Don't Need More Parameters. They Need Loops.

LLMs Don't Need More Parameters. They Need Loops.

Explaining OpenAI's o1 Reasoning Models

Explaining OpenAI's o1 Reasoning Models

AI, Machine Learning, Deep Learning and Generative AI Explained

AI, Machine Learning, Deep Learning and Generative AI Explained

Don't learn AI Agents without Learning these Fundamentals

Don't learn AI Agents without Learning these Fundamentals