Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines
Expected Behavior
When generating array valued column generation spec, use different random seed for each element
Current Behavior
When generating multiple values for array elements, current default random seed produces same value for each array element:
For example:
Workaround
Add randomSeed option of -1 to array valued column - however the data generation is then not-repeatable.
Context
Your Environment
dbldatagen
version used: