Open jaehyeon-kim opened 5 hours ago
Hello @jaehyeon-kim,
I believe that the bucket transform uses the hash of the id
to bucket the data. So the only predicate we can push down in this case is equality. The issue is that id <= 3
can't be pushed down due to the less than component.
Can you try the following query with an equal instead?
iceberg_table1 = catalog.load_table("demo.sample1")
df1 = daft.read_iceberg(iceberg_table1).where(col("id") == 3)
df1.explain(show_all=True)
Describe the bug
I have an iceberg table that is partitioned by three columns. The first column (id) is bucketized.
When I filter by the partition column (id), I see a warning that indicates no partition filter is specified. If the column is not bucketized, I don't see such a warning.
To Reproduce
The iceberg table is created as follows.
Expected behavior
The filter predicate should be passed into partition filters.
Component(s)
Other
Additional context
No response