Differentially Private Prototypes for Imbalanced Transfer Learning
A Google TechTalk, 2024-08-28, presented by Dariush Wahdany ML Privacy Seminar. ABSTRACT: Machine learning (ML) models have been shown to leak private information from their training datasets. Differential Privacy (DP), typically implemented through the differential private stochastic gradient descent algorithm (DP-SGD), has become the standard solution to bound leakage from the models. Despite recent improvements, DP-SGD-based approaches for private learning still usually struggle in the high privacy ($\varepsilon\leq 1$) and low data regimes, and when the private training datasets are imbalanced. To overcome these limitations, we propose Differentially Private Prototype Learning (DPPL) as a new paradigm for private transfer learning. DPPL leverages publicly pre-trained encoders to extract features from private data and generates DP prototypes that represent each private class in the embedding space and can be publicly released for inference. Since our DP prototypes can be obtained from only a few private training data points and without iterative noise addition, they offer high-utility predictions and strong privacy guarantees even under the notion of pure DP. We additionally show that privacy-utility trade-offs can be further improved when leveraging the public data beyond pre-training of the encoder: in particular, we can privately sample our DP prototypes from the publicly available data points used to train the encoder. Our experimental evaluation with four state-of-the-art encoders, four vision datasets, and under different data and imbalancedness regimes demonstrate DPPL's high performance under strong privacy guarantees in challenging private learning setups.

Privacy Amplification for Correlated-Noise Mechanisms via b-Min-Sep Subsampling

Yann LeCun: World Models: Enabling the next AI revolution

Privacy Auditing of Large Language Models

POPri: Private Federated Learning using Preference-Optimized Synthetic Data

Don't learn AI Agents without Learning these Fundamentals

Going Back and Beyond: Emerging (Old) Threats in LLM Privacy and Poisoning

Why Aliens Would NEVER Invade Africa

Turing Award Winner: Disagreeing with Google, Postgres, Future Problems | Mike Stonebraker

Threat Models for Memorization: Privacy, Copyright, and Everything In-Between

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Infantino stinksauer, leere Ränge, Buh-Rufe - und 200.000 Tickets übrig! RIP Fußball WM 2026

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan

Trump Preps for 80th Birthday, Threatens to Hit Iran, Knicks Historic Win & Elon Musk Trillionaire!?

I Gave ChatGPT a Body

Private Adaptations of Large Language Models

🚗 BYD : The biggest SCAM of the car industry ?
![Yann LeCun's $1B Bet Against LLMs [Part 1]](https://i.ytimg.com/vi/kYkIdXwW2AE/hqdefault.jpg?sqp=-oaymwEjCNACELwBSFryq4qpAxUIARUAAAAAGAElAADIQj0AgKJDeAE=&rs=AOn4CLDbV4izF3i-wxevCVIn7FJjoy1vlA)
Yann LeCun's $1B Bet Against LLMs [Part 1]

I Hacked This Temu Router. What I Found Should Be Illegal.

The AI Breakthrough That Will Change Everything (Google DeepMind CEO Interview)

