Defining and Enforcing Privacy in Data Publishing
Many organizations, like the Census, hospitals, and search engine companies, wish to altruistically publish unaggregated data about individuals in order to support research. Such data usually contains personal information about the individuals. The challenge is to anonymize this data such that the sensitive information about individuals is not disclosed, while useful aggregate information is preserved. However, such altruistic data releases can lead to egregious leaks of personal information, like in the case of the well publicized AOL data release fiasco in August 2006. In the first part of this talk, I will motivate the need for formally defining privacy by showing attacks on a very popular anonymization technique called k-Anonymity. I will then present my work on L-Diversity, a formal definition of privacy, that provably limits privacy breaches against bounded adversaries. In the second part of my talk, I will present some of the challenges I faced in applying formal privacy definitions to a real Census data publishing application, called OnTheMap. I will also describe the techniques I developed to combat data sparsity and to ensure that useful information was published by OnTheMap. I will conclude by briefly describing a potential application of my work in the development of a privacy-aware platform that may allow web applications to exploit personal data (search & browsing histories, social networks, tags, etc.) to enhance the users' web experience, while provably guaranteeing their privacy.
![Searching for Privacy [1/2]](https://i.ytimg.com/vi/MzWR4lqyxBY/hqdefault.jpg?sqp=-oaymwE9CNACELwBSFryq4qpAy8IARUAAAAAGAElAADIQj0AgKJDeAHwAQH4Ab4CgALwAYoCDAgAEAEYaSBpKGkwDw==&rs=AOn4CLBdd8FhM8Vb3fDCn02um_do0iAONg)
Searching for Privacy [1/2]

Harvard i-lab | Startup Secrets: Go to Market Strategies

Differential Privacy for Growing Databases

Creator of C++: Bell Labs, Negative Overhead Abstraction, Mistakes | Bjarne Stroustrup

Graph Neural Networks Explained With Easy Pictures!

Training Sand to Think: Artificial General Intelligence & Future of Physics

Databricks Live Bootcamp | Day1: Introduction & Data Analytics

Keynote: After the AI Hype – What’s Real, and What’s Next - Richard Campbell - 2026

"A.I. and Our Economic Future," Professor Chad Jones

Meet the Former CIA Agent Who Wants to Abolish the CIA

What do tech pioneers think about the AI revolution? - The Engineers, BBC World Service

Think Fast, Talk Smart: Communication Techniques

Billionaire's WARNING: I'm SELLING. The Crash Is Already Here!

Why Evolution Split Your Brain In Half – Brain Asymmetry with Jim Al-Khalili

k-anonymity explained

Free Event: Power BI Beginner to Pro 2026 Edition - Full Hands-On Tutorial
![Understand AI in 14 minutes – with Anthropic's Chloe Lubinski [ARC 2026]](https://i.ytimg.com/vi/aBUniZHgCnE/hqdefault.jpg?sqp=-oaymwEjCNACELwBSFryq4qpAxUIARUAAAAAGAElAADIQj0AgKJDeAE=&rs=AOn4CLCyQJdkwlip_867U0IUOY4wCWZJ0g)
Understand AI in 14 minutes – with Anthropic's Chloe Lubinski [ARC 2026]

The World's Most Important Machine
![SQL Course for Beginners [Full Course]](https://i.ytimg.com/vi/7S_tz1z_5bA/hqdefault.jpg?sqp=-oaymwEjCNACELwBSFryq4qpAxUIARUAAAAAGAElAADIQj0AgKJDeAE=&rs=AOn4CLCAEolqW9nvnTsvv0q31O_tNsNdIw)
SQL Course for Beginners [Full Course]

