gianlucatruda / TableDiffusion

The first differentially-private diffusion model for tabular data
https://arxiv.org/abs/2308.14784
GNU General Public License v3.0
15 stars 2 forks source link
differential-privacy diffusion-models generative-adversarial-network generative-ai pytorch tabular-data

TableDiffusion

arxivbadge githubbadge blogbadge

This is the supporting code for the paper Generating tabular datasets under differential privacy.

Please check out a quick overview on my blog.

TableDiffusion is a project focused on providing differentially-private generative models for sensitive tabular data. The goal is to enable the synthesis of data that maintains the statistical properties of the original dataset while ensuring the privacy of individuals' information.

The most notable model from this work is TableDiffusion, the first differentially-private diffusion model for tabular data. See tablediffusion/models/table_diffusion.py

:warning: Disclaimer: This codebase is intended for research purposes only and is not ready for production use. The current implementation may not preserve privacy guarantees due to seed and sampler settings that are not suitable for a production environment.

Paper explanation on YouTube

https://youtu.be/2QRrGWoXOb4

Paper presentation on YouTube

Citing this work

Truda, Gianluca. "Generating tabular datasets under differential privacy." arXiv preprint arXiv:2308.14784 (2023).

@article{truda2023generating,
  title={Generating tabular datasets under differential privacy},
  author={Truda, Gianluca},
  journal={arXiv preprint arXiv:2308.14784},
  year={2023}
}