| ||||
| ||||
![]() Title:Differentially Private Non Parametric Copulas: Generating synthetic data with non parametric copulas under privacy guarantees Authors:Pablo A. Osorio-Marulanda, John Esteban Castro Ramirez, Mikel Hernandez, Nicolas Moreno Reyes and Gorka Epelde Unanue Conference:IEEE CBMS 2025 Tags:categorical and numeric data, copula based models, Differential Privacy, differential privacy through an enhanced fourier perturbation, differentially private synthetic data, dp copula and dp histogram, dp histogram and dp copula, dp npc method, dp npc model, generate synthetic data, generating synthetic data, generation of synthetic data, hospital dataset, Non Parametric Copulas, privacy budget, privacy guarantees, privacy risk, privacy through an enhanced fourier perturbation method, private non parametric copula dp npc, Synthetic Data Generation, synthetic data generation model and synthetic data generation models Abstract: Creation of models to generate synthetic data has represented a significant advancement across diverse scientific fields, but this technology also brings important privacy considerations for users. This work focuses on enhancing a non-parametric copula-based synthetic data generation model, DP-NPC, by incorporating Differential Privacy through an Enhanced Fourier Perturbation method. The model generates synthetic data for mixed tabular databases while preserving privacy. We compare DP-NPC with three other models (PrivBayes, DP-Copula, and DP-Histogram) across three public datasets (income with sociodemographic information, criminal defendant’s likelihood data, and Hospital data considering utilization, charity, and admission data), evaluating privacy, utility, and execution time. DP-NPC outperforms others in modeling multivariate dependencies, maintaining privacy for small epsilon values, and reducing training times. However, limitations include the need to assess the model's performance with different encoding methods for categorical variables and consider additional privacy attacks. Future research should address these areas to enhance privacy-preserving synthetic data generation. Differentially Private Non Parametric Copulas: Generating synthetic data with non parametric copulas under privacy guarantees ![]() Differentially Private Non Parametric Copulas: Generating synthetic data with non parametric copulas under privacy guarantees | ||||
Copyright © 2002 – 2025 EasyChair |