FINRAOS / DataGenerator

DataGenerator is a Java library for systematically producing large volumes of data. DataGenerator frames data production as a modeling problem, with a user providing a model of dependencies among variables and the library traversing the model to produce relevant data sets.
http://finraos.github.io/DataGenerator
Apache License 2.0
161 stars 170 forks source link

Reduce number of maven dependencies for dg-core #306

Open sujitrect opened 8 years ago

sujitrect commented 8 years ago

I am a new user and thought that this small utility should have very less dependencies but was amazed to find that the maven dependency jars it(dg-core) has/downloads is a huge 135 jars. Naturally the initial setup took a very long time for me (as my mvn repo didnt have all these jars). Moreover, this is a bottleneck if i want to embed this in some high level project. Can you please evaluate to keep the bare minimum dependencies?

mibrahim commented 8 years ago

Agreed. We're looking into it.

dovidkopel commented 8 years ago

We will take out the Spark related dependencies and the classes that utilize them and make that into a sub-module off of dg-core.