-
The files in the bucket cannot be found anymore due to use of backslash instead of forward slash.
Temporary solution is to rename and move the files once they appear in the bucket (but the concepts w…
-
I am quite interested in using `LeRobotDataset` for large scale training. I am interested to get more context on the options for storing images so I am aware of the implications this might have:
- Di…
-
### Checks
- [X] I have checked that this issue has not already been reported.
- [X] I have confirmed this bug exists on the [latest version](https://pypi.org/project/polars/) of Polars.
### Reprodu…
-
I encountered a panic when processing parquet columns with logical_type=Null. This issue arises specifically when the library attempts to create a dictionary from a column with this logical type.
###…
-
**Describe the issue**:
While experimenting with `dd.read_parquet(..., filesystem="arrow")`, I noticed that I get a strange error whenever `distributed` hasn't been imported beforehand. I'm not sur…
-
# Python version
![DevVersion](https://github.com/user-attachments/assets/7513f040-e266-41de-ad21-4f983a1816b9)
# Code
## Traceback
![Parquet_export](https://github.com/user-attachments/assets/…
-
### Describe the enhancement requested
This part requires write arrow's StringView/BinaryView/LargeStringView/LargeBinaryView to Parquet file.
The Parquet library has the layers below:
1. arrow…
-
### How can we reproduce the crash?
Read sequentially many parquet files using DuckDB
### JavaScript/TypeScript code that reproduces the crash?
```shell
I cannot
```
### Relevant log output
```s…
-
### Bug description
Using the test method provided by @qqibrow https://github.com/facebookincubator/velox/issues/7478, four compression formats(GZIP, SNAPPY, LZO and UNCOMPRESSED) and two parquet …
-
**Describe the issue**:
It looks like `dask-expr` optimizes the query wrongly. Adding the extra `persist()` fixes the OOM memory:
```text
-----------------------------------------------------------…