-
I noticed that querying a small number of Parquet files takes longer than I would expect. I think it's worth looking into where the time is being spent and building some tooling around doing profiling…
-
### Describe the enhancement requested
`pyarrow.dataset.write_dataset(compression='lz4_raw')` currently fails with:
```
Traceback (most recent call last):
File "/work/projects/lisa/testpyarrow…
-
### Checks
- [X] I have checked that this issue has not already been reported.
- [X] I have confirmed this bug exists on the [latest version](https://pypi.org/project/polars/) of Polars.
### Reprodu…
-
### Describe the bug
When a column has data type in `Dictionary`, the parquet metadata statistics returns `Exact(Dictionary(Int32, Utf8(NULL)))` for min and max values
### To Reproduce
Run the test…
-
Hi there,
I was using MPC Hub, and now switching to using API. I'm working with Sentinel-2 metadata parquet, which is very useful in per-filtering the tiles which I want to request later through th…
-
**Is your feature request related to a problem? Please describe.**
I am trying out daft as an alternative to spark, in our current use of spark we make use of the feature that even if dataframe is em…
-
### Is your feature request related to a problem or challenge?
Part of https://github.com/apache/datafusion/issues/10922
We are adding APIs to efficiently convert the data stored in Parquet's "P…
-
**Describe the bug**
When I try to open a parquet file with the `perspective-parquet` viewer in my JupyterHub I get `Error. Perspective could not render the data`.
**To Reproduce**
- add `per…
-
### Describe the bug, including details regarding any error messages, version, and platform.
Fails with:
```
Cannot decrypt ColumnMetadata. FileDecryption is not setup correctly
```
This is u…
-
At present the only drawback we are facing is that ENTRADA2 does not store data from the DNS response message, such as the address that the DNS query resolves to. We would like to ask if this feature…