yandex-research / tab-ddpm

[ICML 2023] The official implementation of the paper "TabDDPM: Modelling Tabular Data with Diffusion Models"
https://arxiv.org/abs/2209.15421
MIT License
397 stars 89 forks source link

How to tune SMOTE and CatBoost at the same time? #36

Open Minhvt34 opened 5 months ago

Minhvt34 commented 5 months ago

Hi, thank you for sharing your code.

I found that in tune_smote.py the metric metrics = train_catboost(...). I am wondering that we want to tune SMOTE to get better sampling data, however, the quality of sampled data is measured by the performance of catboost that means the catboost should have a tuned hyperparameters. As I understand, we can only tune SMOTE with a tuned CatBoost or we can tune CatBoost if we have a high quality sampling data from SMOTE. So if it is possible, for each SMOTE tuning iteration, we will get a data sample, then we will tune CatBoost.

for iteration (tune SMOTE):
      data = sample_smote
      for iteration (tune CatBoost):
           tune CatBoost

I cannot find where you deploy the tune_smote.py.

Best,