Loading...
Please wait, while we are loading the content...
Similar Documents
Differential Privacy for Social Science Inference
| Content Provider | Semantic Scholar |
|---|---|
| Author | Orazio, Vito D’ Honaker, James King, Gary |
| Copyright Year | 2015 |
| Abstract | Social scientists often want to analyze data that contains sensitive personal information that must remain private. However, common techniques for data sharing that attempt to preserve privacy either bring great privacy risks or great loss of information. A long literature has shown that anonymization techniques for data releases are generally open to reidentification attacks. Aggregated information can reduce but not prevent this risk, while also reducing the utility of the data to researchers. Even publishing statistical estimates without releasing the data cannot guarantee that no sensitive personal information has been leaked. Differential Privacy, deriving from roots in cryptography, is one formal, mathematical conception of privacy preservation. It brings provable guarantees that any reported result does not reveal information about any one single individual. In this paper we detail the construction of a secure curator interface, by which researchers can have access to privatized statistical results from their queries without gaining any access to the underlying raw data. We introduce differential privacy and the construction of differentially private summary statistics. We then present new algorithms for releasing differentially private estimates of causal effects and the generation of differentially private covariance matrices from which any least squares regression may be estimated. We demonstrate the application of these methods through our curator interface. ∗For discussions and comments we thank Natalie Carvalho, Vishesh Karwa, Jack Murtagh, Kobbi Nissim, Or Sheffet, Adam Smith, Salil Vadhan, and numerous other members of the “Privacy Tools for Sharing Research Data” project http://privacytools.seas.harvard.edu. This work was supported by the NSF (CNS-1237235), the Alfred P. Sloan Foundation and a Google gift. †Assistant Professor in the School of Economic, Political, and Policy Sciences at the University of TexasDallas. ‡Senior Research Scientist, Institute for Quantitative Social Science, 1737 Cambridge Street, Cambridge, MA 02138 (jhonaker@iq.harvard.edu, http://hona.kr) §Albert J. Weatherhead III University Professor, Harvard University, Institute for Quantitative Social Science, 1737 Cambridge Street, Cambridge, MA 02138 (king@harvard.edu, http://GaryKing.org) |
| File Format | PDF HTM / HTML |
| Alternate Webpage(s) | http://www.sas.rochester.edu/psc/polmeth/papers/Dorazio_Honaker_King.pdf |
| Language | English |
| Access Restriction | Open |
| Subject Keyword | Algorithm Biologic Preservation Biological Science Disciplines Causal filter Cryptography Curator Data anonymization Differential privacy Estimated Hearing Loss, High-Frequency IBM Notes Inference Interface Device Component Jack Device Component Jack Russell terrier dog breed Least squares Mathematics Nephrogenic Systemic Fibrosis Personally identifiable information Plant Roots Provable security |
| Content Type | Text |
| Resource Type | Article |