man-group / ArcticDB

ArcticDB is a high performance, serverless DataFrame database built for the Python Data Science ecosystem.
http://arcticdb.io
Other
1.46k stars 93 forks source link

Dedup broken with V2 encoding #1645

Closed alexowens90 closed 2 months ago

alexowens90 commented 3 months ago

Dedup with v2 encoding incorrectly matches data segments that are not identical. See test_arctic_batch.py::test_write_pickle_batch_dataerror for an example. Note that this predates #1317, should be retested after that is merged.

alexowens90 commented 2 months ago

Retested post #1317 merge and now works