yandex-research / tab-ddpm

[ICML 2023] The official implementation of the paper "TabDDPM: Modelling Tabular Data with Diffusion Models"
https://arxiv.org/abs/2209.15421
MIT License
397 stars 89 forks source link

Question about tune evaluation models #26

Closed zealscott closed 1 year ago

zealscott commented 1 year ago

Hi,

Thanks for your wonderfull paper and code! I have a small question about the evulation process:

In my understanding, tuned_models/ stores the tuned parameters for each evaluation model (CatBoost or MLP) on real data, and when we tune a new synthesis method like Tab-DDPM, we just use tuned parameters to evaluate the synthesized data? Is that right?

If so, I am wondering whether it is possible that for synthesized data, there are better parameters that can reach the better performance on CatBoost or MLP?

Thanks and looking forward to your rely!

rotot0 commented 1 year ago

Hi,

Yes, you are right about tuned parameters. As for tuning on synthesized data, we tried that and there was no substantial difference.

Thanks for your question!