Closed johnynek closed 5 years ago
@ttim @dieu can you all take a look?
nice tests. i'm excited to see whether switching the persist mode will fix the maxResultSize exceptions i'm seeing.
Thanks for the review @stephbian ! I'll publish an internal version tomorrow and we can see if the persist changes are suitable for us.
this is based on some work with @stephbian.
We are attempting to use spark-scalding for an internal library that compiles to scalding, vs make it compile to spark.
We do three things:
I didn't discover any bugs, but probably the pattern wasn't clear to people. It would be nice to add more built in types. I will try to make a follow up using csv/tsv which requires a bit more work (since scalding and spark both need typeclasses to describe the types).