The US Bureau of the Census (BOC) collects large quantities of data that can be useful for research and decision making by policymakers, businesses, and academics. The BOC is responsible for analyzing and publishing useful statistical data. As much of the collected data pertain to individuals, households, and establishments, the BOC also has a legal obligation to protect their privacy. It is difficult and labor-intensive to determine with confidence that these two requirements—privacy and utility—have been satisfied.
This research project seeks to further the use of formal approaches such as differential privacy that have the potential to provide rigorous guarantees that legal requirements for privacy and utility are met. Applying such approaches requires (a) bridging legal privacy requirements with mathematical privacy requirements, and (b) designing analysis methods that provide statistical utility while satisfying the privacy requirements from (a). Our team has developed tools satisfying these two goals in the context of Harvard’s Privacy Tools project, and we seek to collaborate with BOC staff to develop similar solutions that are tailored to the bureau’s specific requirements.
We address two major challenges confronting wider adoption of formal privacy models: (a) There is a wide conceptual and practical gap between the approaches found in formal privacy models and the heuristic approaches in current use and contemplated by existing regulatory and policy frameworks. (b) There is a gap between theoretical developments showing that formal privacy models like differential privacy permit, in principle, a wide collection of analyses and the actual use of analysis and publication techniques by the BOC. This project will result in methods for publishing data in ways that satisfy both formal mathematical privacy requirements and legal standards for privacy protection, thereby furthering “improvements to existing methods that protect privacy, avoiding the release of any information that would identify an individual or business in public statistics.”
This project is funded by a grant from the US Bureau of the Census.