POPri: Private Federated Learning using Preference-Optimized Synthetic Data

A Google TechTalk, 2025-12-17, presented by Charlie Hou Privacy in Machine Learning Seminar. ABSTRACT: In practical settings, differentially private Federated learning (DP-FL) is the dominant method for training models from private, on-device client data. Recent work has suggested that DP-FL may be enhanced or outperformed by methods that use DP synthetic data (Wu et al., 2024; Hou et al., 2024). The primary algorithms for generating DP synthetic data for FL applications require careful prompt engineering based on public information and/or iterative private client feedback. Our key insight is that the private client feedback collected by prior DP synthetic data methods (Hou et al., 2024; Xie et al., 2024) can be viewed as an RL (reinforcement learning) reward. Our algorithm, Policy Optimization for Private Data (POPri) harnesses client feedback using policy optimization algorithms such as Direct Preference Optimization (DPO) to fine-tune LLMs to generate high-quality DP synthetic data. To evaluate POPri, we release LargeFedBench, a new federated text benchmark for uncontaminated LLM evaluations on federated client data. POPri substantially improves the utility of DP synthetic data relative to prior work on LargeFedBench datasets and an existing benchmark from Xie et al. (2024). POPri closes the gap between next-token prediction accuracy in the fully-private and non-private settings by up to 58%, compared to 28% for prior synthetic data methods, and 3% for state-of-the-art DP federated learning methods. https://arxiv.org/abs/2504.16438

The Limits and Possibilities of One Run Auditing
▶︎

The Limits and Possibilities of One Run Auditing

Going Back and Beyond: Emerging (Old) Threats in LLM Privacy and Poisoning
▶︎

Going Back and Beyond: Emerging (Old) Threats in LLM Privacy and Poisoning

LLMs as Rerankers: A Case Study on Hybrid Email Search #HaystackConf
▶︎

LLMs as Rerankers: A Case Study on Hybrid Email Search #HaystackConf

Taylor Sparks: The LLM Revolution in Materials Science: From Data Extractionto Crystal Design
▶︎

Taylor Sparks: The LLM Revolution in Materials Science: From Data Extractionto Crystal Design

Differentially Private Prototypes for Imbalanced Transfer Learning
▶︎

Differentially Private Prototypes for Imbalanced Transfer Learning

Leveraging Public Data in the OpenADMET ExpansionRx Blind Challenge
▶︎

Leveraging Public Data in the OpenADMET ExpansionRx Blind Challenge

The FASTEST introduction to Reinforcement Learning on the internet
▶︎

The FASTEST introduction to Reinforcement Learning on the internet

Privacy Auditing of Large Language Models
▶︎

Privacy Auditing of Large Language Models

MCP vs API: Simplifying AI Agent Integration with External Data
▶︎

MCP vs API: Simplifying AI Agent Integration with External Data

Trump Preps for 80th Birthday, Threatens to Hit Iran, Knicks Historic Win & Elon Musk Trillionaire!?
▶︎

Trump Preps for 80th Birthday, Threatens to Hit Iran, Knicks Historic Win & Elon Musk Trillionaire!?

Privacy Ripple Effects from Adding or Removing Personal Information in Language Model Training
▶︎

Privacy Ripple Effects from Adding or Removing Personal Information in Language Model Training

Something is jamming GPS over Europe. Here's what we found
▶︎

Something is jamming GPS over Europe. Here's what we found

Private Adaptations of Large Language Models
▶︎

Private Adaptations of Large Language Models

Why AI Agents are either the best or worst thing we’ve ever built
▶︎

Why AI Agents are either the best or worst thing we’ve ever built

Worst-Case Membership Inference of Language Models
▶︎

Worst-Case Membership Inference of Language Models

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan
▶︎

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan

Turing Award Winner: Disagreeing with Google, Postgres, Future Problems | Mike Stonebraker
▶︎

Turing Award Winner: Disagreeing with Google, Postgres, Future Problems | Mike Stonebraker

The Strange Math That Predicts (Almost) Anything
▶︎

The Strange Math That Predicts (Almost) Anything

HOLY ROSARY TODAY THURSDAY, JUNE 11, 2026 ST. JUDE THADDEUS & LUMINOUS MYSTERIES | DAILY HOLY ROSARY
▶︎

HOLY ROSARY TODAY THURSDAY, JUNE 11, 2026 ST. JUDE THADDEUS & LUMINOUS MYSTERIES | DAILY HOLY ROSARY

The AI Breakthrough That Will Change Everything (Google DeepMind CEO Interview)
▶︎

The AI Breakthrough That Will Change Everything (Google DeepMind CEO Interview)