Open gatesn opened 4 months ago
Capturing from slack:
Currently our tableprovider's pushdown is bottlenecked by take(varbin)
DataFusion defers to arrow's filter_bytes function to turn the predicate mask into new ArrayRef:
We want to have our own boolean builder to construct these masks, calculating run lengths, and using that to alternate between slicing/indexing in our implementation of take()
We should support converting between strict sorted integers and boolean masks. We may need an array type to go in both directions?
This could allow us to remove the RoaringUInt array