-
### Description
when loading parquet files, it would be nice if polars could trace which file each row came from
something like
```python
df = pl.read_parquet('*.parquet', with_source_column=Tru…
-
**Describe the bug**
I successfully created a .hyper, but in tableau, it is unable to connect to it. Version 2023.1.0
"""
Unable to connect to the server. Check that the server is running and tha…
-
Our existing Parquet reading code does not support v2 pages:
* `io.deephaven.parquet.ColumnPageReaderImpl#readPageV2` throws `UnsupportedOperationException`
* `io.deephaven.parquet.ColumnPageReaderI…
-
### What happens?
`duckdb` and arrow seem to write parquet files at roughly the same speed until the data gets to about 10+ GB, at which point duckdb is about an order of magnitude slower.
The i…
-
i cant view a parquet file.
line below throws error -> "don't know how to skip type Set"
var parquetReader = await ParquetReader.CreateAsync(parquetFilePath, null, cancellationToken);
The error m…
-
### Describe the enhancement requested
We want to start using fp16 data for our Ml workflows. We hoped for disk space savings, reduced RAM consumption, and doubled reading performance. Parquet files …
-
### Apache Iceberg version
1.4.3 (latest release)
### Query engine
Spark
### Please describe the bug 🐞
I have an Iceberg table, and I want to create two bloom filters on a root string column and …
-
### Is there an existing issue for this?
- [ ] I have searched the existing issues
- [ ] I have checked [#657](https://github.com/microsoft/graphrag/issues/657) to validate if my issue is covered by …
-
### Is your feature request related to a problem or challenge?
We have several forms of predicate pushdown in DataFusion's Parquet reader. The code path taken depends on the exact data layout and pre…
alamb updated
1 month ago
-
I noticed a difference in fsspec's handling of folders containing parquet files:
Call method:` pd.read_parquet ("s3://xxx/test_dir/")`
Normally, if there is a parquet file under the test_dir, this m…