FINRAOS / DataGenerator

DataGenerator is a Java library for systematically producing large volumes of data. DataGenerator frames data production as a modeling problem, with a user providing a model of dependencies among variables and the library traversing the model to produce relevant data sets.
http://finraos.github.io/DataGenerator
Apache License 2.0
161 stars 170 forks source link

Implementation of DG in Spark; Separated code for dg-spark-example #267

Closed Brijeshrpatel9 closed 8 years ago

mibrahim commented 8 years ago

There are lots of ^M - seems to be windows EOL, that's why there's conflicts. I'll work on fixing those.

ghost commented 8 years ago

We learned in EtlMgmt, which uses github for its core scripts, that github is not configured to automatically remove carriage returns (which are represented as ^M). Svn can be configured to remove the unix-unfriendly characters which is why they had not previously been a problem.

mibrahim commented 8 years ago

I'll work on those, as well as instantiating the consumer inside the parallelize portion & some other cleanup that I see as well. Will send from my repo including @Brijeshrpatel9 additions and commits.

mibrahim commented 8 years ago

already merged through a different PR