FINRAOS / DataGenerator

DataGenerator is a Java library for systematically producing large volumes of data. DataGenerator frames data production as a modeling problem, with a user providing a model of dependencies among variables and the library traversing the model to produce relevant data sets.
http://finraos.github.io/DataGenerator
Apache License 2.0
161 stars 170 forks source link

State space explosion? #301

Closed ptelang closed 8 years ago

ptelang commented 8 years ago

I am trying to generate dataset with about 50 variables out of which about 25 are categorical variables. I am using "file" tag for specifying possible values for the 25 categorical variables. The rest of the variables are specified using "assign" tag. Each file for the 25 categorical variables contains less than 10 values. DG framework chokes for more than 11 variables. It appears to freeze; I let it run for over 15mins and it produced nothing and caused my computer's fan to run faster than I have ever seen before! I am running it in Eclipse on a Mac with 16GB RAM and 2.5 GHz i7 processor.

Is there any solution for this issue?

mpeter28 commented 8 years ago

State space explosion is definitely an issue, and we have a guide (https://github.com/FINRAOS/DataGenerator/blob/master/docs/SCXMLModelOptimization.rst) on working around that. Hopefully that helps!

mibrahim commented 8 years ago

In case you were putting all those states in a single node, consider splitting them into several nodes. DG attempts to compute the product of all variables in the same node before moving along. Most likely, that's the reason it's choking.

Please re-open if we didn't answer your question. Thanks,