Team-TUD / CTAB-GAN

Official git for "CTAB-GAN: Effective Table Data Synthesizing"
Apache License 2.0
80 stars 19 forks source link

Questions about synthetic data #22

Closed limhasic closed 10 months ago

limhasic commented 10 months ago

Considering columns with high correlation in the original data, if regression or classification of problem_type is performed,

is it meaningful to synthesize columns one by one?

zhao-zilong commented 10 months ago

Hi @limhasic

No, no matter you indicate the problem_type or not, the synthesis is generated row by row. And for each row, it generates together.