Closed burhanuddin-123 closed 3 weeks ago
Hi @burhanuddin-123 👋
Do you mind sharing more context with us so we can try to reproduce the issue on our end?
One thing I want to rule out is missing referential integrity, where all references in a foreign key reference a valid, existing primary key value. We created a function in our utils library to help process your data before model fitting. Try doing this step first before fitting and sampling. I doubt this is the issue since SDV usually checks for ref integrity, but still want to rule it out first.
Hi there @burhanuddin-123 are you still running into this issue?
Another user ran into a very similar issue and it seems to be related to the scale
parameter in their case. What value are you using for scale
when sampling from HMA Synthesizer?
We opened this new issue to track the bug with the proposed solution as well: https://github.com/sdv-dev/SDV/issues/2045
Hi there @burhanuddin-123 I haven't heard from you in a while so I'm going to go ahead and close this issue out. Please see the suggested workaround if you're still running into this issue: https://github.com/sdv-dev/SDV/issues/2045#issue-2334275417
Environment details
If you are already running SDV, please indicate the following details about the environment in which you are running it:
Problem description
I am looking to generate synthetic data at scale, for two tables (Customers, and Orders) having a relationship between them, where customers is a parent and orders as a child. After Validating the MultiTableMetadata and applying constraints, I was also able to fit the HMASynthesizer on real data.
But while generating the sample data, I am getting the following error:
I had tried to generate it multiple times, and each time I got different KeyError, such as
KeyError: 4
,KeyError: 7
, and so on. It is difficult to identify the root cause of this error.