Andrea Montanari | Self-induced regularization from linear regression to neural networks

CMSA New Technologies in Mathematics Seminar 9/23/2020 Speaker: Andrea Montanari, Departments of Electrical Engineering and Statistics, Stanford Title: Self-induced regularization from linear regression to neural networks Abstract: Modern machine learning methods -most noticeably multi-layer neural networks- require to fit highly non-linear models comprising tens of thousands to millions of parameters. Despite this, little attention is paid to the regularization mechanism to control model's complexity. Indeed, the resulting models are often so complex as to achieve vanishing training error: they interpolate the data. Despite this, these models generalize well to unseen data : they have small test error. I will discuss several examples of this phenomenon, beginning with a simple linear regression model, and ending with two-layers neural networks in the so-called lazy regime. For these examples precise asymptotics could be determined mathematically, using tools from random matrix theory. I will try to extract a unifying picture. A common feature is the fact that a complex unregularized nonlinear model becomes essentially equivalent to a simpler model, which is however regularized in a non-trivial way. [Based on joint papers with: Behrooz Ghorbani, Song Mei, Theodor Misiakiewicz, Feng Ruan, Youngtak Sohn, Jun Yan, Yiqiao Zhong]

Pierfrancesco Urbani | Separation of timescales controls feature learning...
▶︎

Pierfrancesco Urbani | Separation of timescales controls feature learning...

Yann LeCun: World Models: Enabling the next AI revolution
▶︎

Yann LeCun: World Models: Enabling the next AI revolution

Nicolas Boffi - Flow map language models
▶︎

Nicolas Boffi - Flow map language models

Chris Fonnesbeck - Flexible Statistical Modeling | Pydata London 26
▶︎

Chris Fonnesbeck - Flexible Statistical Modeling | Pydata London 26

Regularization Part 1: Ridge (L2) Regression
▶︎

Regularization Part 1: Ridge (L2) Regression

This is not the AI we were promised | The Royal Society
▶︎

This is not the AI we were promised | The Royal Society

Python Project | Python Projects For Beginners | Python Project Tutorial | Intellipaat
▶︎

Python Project | Python Projects For Beginners | Python Project Tutorial | Intellipaat

Alastair Campbell and Jacob Rees-Mogg Debate Brexit | The Mishal Husain Show
▶︎

Alastair Campbell and Jacob Rees-Mogg Debate Brexit | The Mishal Husain Show

Türkei – USA Highlights | Gruppe D, FIFA WM 2026 | sportstudio
▶︎

Türkei – USA Highlights | Gruppe D, FIFA WM 2026 | sportstudio

Trumps Pool-Fiasko | Peinliche Videos von Politikern zur WM | extra 3 vom 25.06.2026 · Teil 2/2
▶︎

Trumps Pool-Fiasko | Peinliche Videos von Politikern zur WM | extra 3 vom 25.06.2026 · Teil 2/2

A Nobel Laureate's Honest Review of AI In Biology
▶︎

A Nobel Laureate's Honest Review of AI In Biology

How Proctor’s texts in Karen Read lawsuit could free dangerous criminals
▶︎

How Proctor’s texts in Karen Read lawsuit could free dangerous criminals

Keynote: After the AI Hype – What’s Real, and What’s Next - Richard Campbell - 2026
▶︎

Keynote: After the AI Hype – What’s Real, and What’s Next - Richard Campbell - 2026

🫀 2025 BLS Practice Test | CPR & AED Practice Test with Detailed Answers
▶︎

🫀 2025 BLS Practice Test | CPR & AED Practice Test with Detailed Answers

Dan Freed | First Proof Introduction
▶︎

Dan Freed | First Proof Introduction

Stanford CS229: Machine Learning - Linear Regression and Gradient Descent |  Lecture 2 (Autumn 2018)
▶︎

Stanford CS229: Machine Learning - Linear Regression and Gradient Descent | Lecture 2 (Autumn 2018)

The generalization error of overparametrized models - Andrea Montanari (Stanford) MAD+ 24 Jun 2020
▶︎

The generalization error of overparametrized models - Andrea Montanari (Stanford) MAD+ 24 Jun 2020

Historian Timothy Snyder on ENDING Trump Nightmare FOR GOOD | PoliticsGirl
▶︎

Historian Timothy Snyder on ENDING Trump Nightmare FOR GOOD | PoliticsGirl

But what is quantum computing?  (Grover's Algorithm)
▶︎

But what is quantum computing? (Grover's Algorithm)

But what is a neural network? | Deep learning chapter 1
▶︎

But what is a neural network? | Deep learning chapter 1