Closed ryantwolf closed 5 months ago
@ayushdg could you please take a look at why the exact deduplication tests are failing here? I didn't modify anything related to dedup, but evidently one of my changes triggered this.
Haven't been able to reproduce locally. But on first glance it looks like the dask config options when we get to the assert_eq is no longer "tasks" for the shuffle method. Overriding it in the test would allow things to past but I'm curious to see why it isn't reproducing locally.
Add functionality for large scale dataset blending and shuffling.