Closed coastalwhite closed 1 week ago
It seems in general that projection pushdown with globs is broken. You don't even have to filter.
import polars as pl
pl.DataFrame({ 'a': [1, 2, 3], 'b': [4, 5, 6] }).write_parquet('1.parquet')
pl.DataFrame({ 'a': [1, 2, 3], 'b': [4, 5, 6] }).write_parquet('2.parquet')
pl.scan_parquet('*.parquet', row_index_name='idx').select(pl.col.a, pl.col.idx).collect()
This also panics.
Nevermind, the problem was that there was another file with a different schema that also got globbed in. Changing title.
This could be the same as https://github.com/pola-rs/polars/issues/13436
Checks
Reproducible example
Log output
Issue description
If there is a mismatch in schemas between two glob files, there is a panic instead of a proper error.
Expected behavior
No panic.
Installed versions