Distribution-Preserving Statistical Disclosure Limitation

Woodcock, Simon and Benedetto, Gary (2006): Distribution-Preserving Statistical Disclosure Limitation.

Preview

PDF
MPRA_paper_155.pdf
Download (1MB) | Preview

Abstract

One approach to limiting disclosure risk in public-use microdata is to release multiply-imputed, partially synthetic data sets. These are data on actual respondents, but with confidential data replaced by multiply-imputed synthetic values. A mis-specified imputation model can invalidate inferences because the distribution of synthetic data is completely determined by the model used to generate them. We present two practical methods of generating synthetic values when the imputer has only limited information about the true data generating process. One is applicable when the true likelihood is known up to a monotone transformation. The second requires only limited knowledge of the true likelihood, but nevertheless preserves the conditional distribution of the confidential data, up to sampling error, on arbitrary subdomains. Our method maximizes data utility and minimizes incremental disclosure risk up to posterior uncertainty in the imputation model and sampling error in the estimated transformation. We validate the approach with a simulation and application to a large linked employer-employee database.

Item Type:	MPRA Paper
Original Title:	Distribution-Preserving Statistical Disclosure Limitation
Language:	English
Keywords:	statistical disclosure limitation; confidentiality; privacy; multiple imputation; partially synthetic data
Subjects:	C - Mathematical and Quantitative Methods > C4 - Econometric and Statistical Methods: Special Topics C - Mathematical and Quantitative Methods > C8 - Data Collection and Data Estimation Methodology ; Computer Programs > C81 - Methodology for Collecting, Estimating, and Organizing Microeconomic Data ; Data Access
Item ID:	155
Depositing User:	Simon Woodcock
Date Deposited:	07 Oct 2006
Last Modified:	27 Sep 2019 05:05
URI:	https://mpra.ub.uni-muenchen.de/id/eprint/155

All papers reproduced by permission. Reproduction and distribution subject to the approval of the copyright owners.

View Item

Atom RSS 1.0 RSS 2.0

Contact us: mpra@ub.uni-muenchen.de

This repository has been built using EPrints software.

MPRA is a RePEc service hosted by .