Open ColinFay opened 6 years ago
I like the idea!
Minor problem with sample_n
is that it would not have the same semantics: it would return a sample of number of chunks * n
instead of n
, but if we document that I can live with that :-)
disk.frame has implemented a sample_frac
and sample_n
is pending.
We could implement a chunk wise sample_n / sample_frac with:
The sample would be done in each chunk that way.
What do you think about that? If it sounds like a good idea, let me know and I'll send you a PR.