Closed chrisvwn closed 6 years ago
You are completely right. Thanks for reporting. I've updated the package with a small fix to also allow to have only 1 split level. Although ffdfdply is intended to be used if you have a lot more split levels to get groups of split levels in RAM, indeed if you have only 1 split level, it now also works correctly. Thanks again.
When applying a function to an ffdf using ffdfdply, if the split parameter has only one level, the split level is returned as NA and seemingly all functions read the data as NA. I assume that ffdfdply should work with one split as well as multiple splits? For example:
returns the following
result
:This is easily checked by adding a second split in the zones e.g. changing the zoneVals entries to
zoneVals <- c(rep(1, 5000), rep(2, 5000))
gives the following result:Did I miss something in how ffdfdply works? I can calculate one split directly, sure, but when working with dynamic aggregation it makes sense for ffdfdply to handle multiple and single splits?