Blake Bordelon - Infinite limits and scaling laws of neural networks - IPAM at UCLA

Recorded 16 October 2024. Blake Bordelon of Harvard University presents "Infinite limits and scaling laws of neural networks" at IPAM's Theory and Practice of Deep Learning Workshop. Abstract: Scaling up the size and training horizon of deep learning models has enabled breakthroughs in computer vision and natural language processing. Empirical evidence suggests that these neural network models are described by regular scaling laws where performance of finite parameter models improves as model size increases, eventually approaching a limit described by the performance of an infinite parameter model. In this talk, we will first examine certain infinite parameter limits of deep neural networks which preserve representation learning and then describe how quickly finite models converge to these limits. Using dynamical mean field theory methods, we provide an asymptotic description of the learning dynamics of randomly initialized infinite width and depth networks. Next, we will empirically investigate how close the training dynamics of finite networks are to these idealized limits. Lastly, we will provide a theoretical model of neural scaling laws which describes how generalization depends on three computational resources: training time, model size and data quantity. This theory allows analysis of compute optimal scaling strategies and predicts how model size and training time should be scaled together in terms of spectral properties of the limiting kernel. The theory also predicts how representation learning can improve neural scaling laws in certain regimes. For very hard tasks, the theory predicts that representation learning can approximately double the training-time exponent compared to the static kernel limit. Learn more online at: https://www.ipam.ucla.edu/programs/wo...

Gintare Karolina Dziugaite - The dynamics of memorization and generalization in deep learning

Gintare Karolina Dziugaite - The dynamics of memorization and generalization in deep learning

Concepts of Multilevel, Longitudinal and Mixed Models 5

Concepts of Multilevel, Longitudinal and Mixed Models 5

Inside the World's Smartest Robot Brain [VLA]

Inside the World's Smartest Robot Brain [VLA]

Yann LeCun's $1B Bet Against LLMs

Yann LeCun's $1B Bet Against LLMs

Mean Field Approaches to Learning Dynamics in Deep Networks | Blake Bordelon, Harvard University

Mean Field Approaches to Learning Dynamics in Deep Networks | Blake Bordelon, Harvard University

Yann LeCun | Self-Supervised Learning, JEPA, World Models, and the future of AI

Yann LeCun | Self-Supervised Learning, JEPA, World Models, and the future of AI

Attention in transformers, step-by-step | Deep Learning Chapter 6

Attention in transformers, step-by-step | Deep Learning Chapter 6

The biggest lie about the double slit experiment

The biggest lie about the double slit experiment

The Brain’s Learning Algorithm Isn’t Backpropagation

The Brain’s Learning Algorithm Isn’t Backpropagation

But what is a neural network? | Deep learning chapter 1

But what is a neural network? | Deep learning chapter 1

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

1: Introduction to Neural Networks and Deep Learning; Training Deep NNs

1: Introduction to Neural Networks and Deep Learning; Training Deep NNs

All Machine Learning Models Clearly Explained!

All Machine Learning Models Clearly Explained!

Trump Brags About His Brain, Crowd Size & Pool, CBS Fires Scott Pelley & Don Jr's Honeymoon Video

Trump Brags About His Brain, Crowd Size & Pool, CBS Fires Scott Pelley & Don Jr's Honeymoon Video

MIT 6.S191: AI for Science

MIT 6.S191: AI for Science

Professor Jiang: World War 3 Is About To Begin, Let Me Explain!

Professor Jiang: World War 3 Is About To Begin, Let Me Explain!

Lecture 7: Explaining Neural Scaling Laws

Lecture 7: Explaining Neural Scaling Laws

Mathe-News 🚨 KI löst das Erdős-Einheitsabstand-Problem!

Mathe-News 🚨 KI löst das Erdős-Einheitsabstand-Problem!

Building the PERFECT Linux PC with Linus Torvalds

Building the PERFECT Linux PC with Linus Torvalds

AlphaFold - The Most Useful Thing AI Has Ever Done

AlphaFold - The Most Useful Thing AI Has Ever Done