tdspora / syngen

Open-source version of the TDspora synthetic data generation algorithm.
https://tdspora.ai/
GNU General Public License v3.0
17 stars 4 forks source link

Fix warning about dataset size #358

Closed tdspora closed 5 months ago

tdspora commented 5 months ago

In case of: number of rows in dataset or row_limit < 100, we should get warning: “The input table is too small to provide any meaningful results. Please consider 1) disable drop_null argument, 2) provide bigger table“ number of rows in dataset or row_limit >=100 and< 500, we should get warning: “The amount of data is {len(data)} rows. It seems that it isn‘t enough to supply high-quality results. To improve the quality of generated data please consider any of the steps: 1) provide a bigger table, 2) disable drop_null argument“.

But we were getting them only when we have row_limit.