sdv-dev / SDV

Synthetic data generation for tabular data
https://docs.sdv.dev/sdv
Other
2.31k stars 304 forks source link

Always create a sample size of 1 no matter how small the scale. #2062

Closed lajohn4747 closed 3 months ago

lajohn4747 commented 3 months ago

resolves #2045 CU-86b0reety

Make sure that the minimum size of root tables of 1 is used for sampling of small scales to avoid errors. Throw a warning that sample size should be bigger for better quality data as cardinality will likely no be accurate.

sdv-team commented 3 months ago

Task linked: CU-86b0reety SDV - HMA Synthesizer's scale parameter doesn't work for small values #2045

lajohn4747 commented 3 months ago

Can you verify that this doesn't break the _enforce_table_size method?

Added a unit test to show _enforce_table_size