Open phofl opened 8 months ago
cc @rjzamora if you have thoughts where this belongs
It should raise once dask-expr actually passes the filters to pyarrow, but this won't happen until we need to know the number of partitions. So, dd.read_parquet(...).compute()
should raise. If this is not the case, then there is indeed a bug somewhere.
Just to clarify: We can still add an extra/earlier validation step to dask-expr. I'll take a look to see where that would be easiest.
Sorry, you are completely correct, it's raising when I call compute.
Ideally we would fail earlier, but this is by no means urgent, I'll change the title
Ideally, this would raise before we trigger compute