-
Hello,
How can we view and extract the information from parquet files?
-
File "Show-o/parquet/refinedweb_dataset.py", line 20, in
from parquet.parquet_dataset import CruiseParquetDataset
ModuleNotFoundError: No module named 'parquet.parquet_dataset'
-
As a systems integrator, I want to be able to have increased control over writing parquet files so that I can implement a process for transforming data overnight.
This ticket needs more defin…
-
Would a PR to add `'application/vnd.apache.parquet'` here be welcome?
https://github.com/falconry/falcon/blob/b29fd5540ae58bed47198ea447f1e9194c34155c/falcon/constants.py#L137-L145
-
https://github.com/dask-contrib/dask-awkward/blob/ca257cade3e0a3bdd7d2607858561170cdfe21f0/src/dask_awkward/lib/io/parquet.py#L511
1. This option is required for status to round-trip through Parque…
-
### What happens?
Impala (CDH 7.1.9) has sometimes issues to read parquet files (which contain null values) generated by duckdb.
```bash
Parquet file '.../test.parquet': metadata is corrupt. Dicti…
-
Hello Enrico, is there an access to the parquet files generated by argo2parquet ?
-
This will help with remotely debugging and understanding the parquet file structure.
We can follow the similar API spec as duck_db: https://duckdb.org/docs/data/parquet/overview
- read_parquet
-…
-
### Apache Iceberg version
main (development)
### Query engine
None
### Please describe the bug 🐞
For nested struct types, when group.field.getId returns null, it causes iField to be nu…
-
Hi,
When tried to write a data.frame that has Posixct columns where some values are `NA`, `write_parquet` throws the following error:
```
> write_parquet(df, "my.parquet")
Error in write_parqu…