Deep Networks Are Kernel Machines (Paper Explained)

#deeplearning #kernels #neuralnetworks Full Title: Every Model Learned by Gradient Descent Is Approximately a Kernel Machine Deep Neural Networks are often said to discover useful representations of the data. However, this paper challenges this prevailing view and suggest that rather than representing the data, deep neural networks store superpositions of the training data in their weights and act as kernel machines at inference time. This is a theoretical paper with a main theorem and an understandable proof and the result leads to many interesting implications for the field. OUTLINE: 0:00 - Intro & Outline 4:50 - What is a Kernel Machine? 10:25 - Kernel Machines vs Gradient Descent 12:40 - Tangent Kernels 22:45 - Path Kernels 25:00 - Main Theorem 28:50 - Proof of the Main Theorem 39:10 - Implications & My Comments Paper: https://arxiv.org/abs/2012.00152 Street Talk about Kernels: • Kernels! ERRATA: I simplify a bit too much when I pit kernel methods against gradient descent. Of course, you can even learn kernel machines using GD, they're not mutually exclusive. And it's also not true that you "don't need a model" in kernel machines, as it usually still contains learned parameters. Abstract: Deep learning's successes are often attributed to its ability to automatically discover new representations of the data, rather than relying on handcrafted features like other learning methods. We show, however, that deep networks learned by the standard gradient descent algorithm are in fact mathematically approximately equivalent to kernel machines, a learning method that simply memorizes the data and uses it directly for prediction via a similarity function (the kernel). This greatly enhances the interpretability of deep network weights, by elucidating that they are effectively a superposition of the training examples. The network architecture incorporates knowledge of the target function into the kernel. This improved understanding should lead to better learning algorithms. Authors: Pedro Domingos Links: TabNine Code Completion (Referral): http://bit.ly/tabnine-yannick YouTube: / yannickilcher Twitter: / ykilcher Discord: / discord BitChute: https://www.bitchute.com/channel/yann... Minds: https://www.minds.com/ykilcher Parler: https://parler.com/profile/YannicKilcher LinkedIn: / yannic-kilcher-488534136 BiliBili: https://space.bilibili.com/1824646584 If you want to support me, the best thing to do is to share out the content :) If you want to support me financially (completely optional and voluntary, but a lot of people have asked for this): SubscribeStar: https://www.subscribestar.com/yannick... Patreon: / yannickilcher Bitcoin (BTC): bc1q49lsw3q325tr58ygf8sudx2dqfguclvngvy2cq Ethereum (ETH): 0x7ad3513E3B8f66799f507Aa7874b1B0eBC7F85e2 Litecoin (LTC): LQW2TRyKYetVC8WjFkhpPhtpbDM4Vw7r9m Monero (XMR): 4ACL8AGrEo5hAir8A9CeVrW8pEauWvnp1WnSDZxW7tziCDLhZAGsgzhRQABDnFy8yuM9fWJDviJPHKRjV4FWt19CJZN9D4n

Yann LeCun - Self-Supervised Learning: The Dark Matter of Intelligence (FAIR Blog Post Explained)

Yann LeCun - Self-Supervised Learning: The Dark Matter of Intelligence (FAIR Blog Post Explained)

MIT 6.S191 (2021): Introduction to Deep Learning

MIT 6.S191 (2021): Introduction to Deep Learning

AlphaFold - The Most Useful Thing AI Has Ever Done

AlphaFold - The Most Useful Thing AI Has Ever Done

Math's Fundamental Flaw

Math's Fundamental Flaw

The Most Misunderstood Concept in Physics

The Most Misunderstood Concept in Physics

AI Is Creating A Rare Opportunity For Investors. How Jim Roppel Is Playing It. | Investing With IBD

AI Is Creating A Rare Opportunity For Investors. How Jim Roppel Is Playing It. | Investing With IBD

Nvidia CEO Jensen Huang Interview| Bloomberg Technology Special

Nvidia CEO Jensen Huang Interview| Bloomberg Technology Special

Something is jamming GPS over Europe. Here's what we found

Something is jamming GPS over Europe. Here's what we found

MIT 6.S191 (2020): Introduction to Deep Learning

MIT 6.S191 (2020): Introduction to Deep Learning

Yann LeCun's $1B Bet Against LLMs [Part 1]

Yann LeCun's $1B Bet Against LLMs [Part 1]

The Strange Math That Predicts (Almost) Anything

The Strange Math That Predicts (Almost) Anything

Feedback Transformers: Addressing Some Limitations of Transformers with Feedback Memory (Explained)

Feedback Transformers: Addressing Some Limitations of Transformers with Feedback Memory (Explained)

But how do AI images and videos actually work? | Guest video by Welch Labs

But how do AI images and videos actually work? | Guest video by Welch Labs

Gradient descent, how neural networks learn | Deep Learning Chapter 2

Gradient descent, how neural networks learn | Deep Learning Chapter 2

But what is a neural network? | Deep learning chapter 1

But what is a neural network? | Deep learning chapter 1

WAITING UPON THE LORD | HOW TO BE RENEWED & REFRESHED IN GOD'S PRESENCE | WAIT ON GOD

WAITING UPON THE LORD | HOW TO BE RENEWED & REFRESHED IN GOD'S PRESENCE | WAIT ON GOD

Who's Adam and What's He Optimizing? | Deep Dive into Optimizers for Machine Learning!

Who's Adam and What's He Optimizing? | Deep Dive into Optimizers for Machine Learning!

Reinforcement Learning: Machine Learning Meets Control Theory

Reinforcement Learning: Machine Learning Meets Control Theory

Pushing Simulations to the LIMIT to Find Order in Chaos

Pushing Simulations to the LIMIT to Find Order in Chaos

Transformers, the tech behind LLMs | Deep Learning Chapter 5

Transformers, the tech behind LLMs | Deep Learning Chapter 5