-
Is it possible to write to parquet file using this library? (quickly checked the code, didn't see any write function).
-
Is this project maintained?
I guess it started as a fork of [parquetjs](https://github.com/ironSource/parquetjs) which was not really well maintained (and is still not).
But now it seems *parqu…
-
See https://huggingface.co/datasets/mikehemberger/inat_2021_train_mini_plantae
```
The size of the content of the first rows (358268 B) exceeds the maximum supported size (200000 B) even after tru…
-
Requires https://github.com/huggingface/datasets/issues/6438, to support GeoParquet. We could support more formats.
Possibly requires geopandas as a dependency.
-
https://huggingface.co/datasets/pufanyi/MIMICIT/tree/main/data/CGD/
CGD_instructions.json has lines with references like `"image_ids": ["CGD_IMG_0000000000069568", "CGD_IMG_000000328270"], "rel_ins_i…
-
See https://huggingface.co/datasets/imvladikon/hebrew_speech_coursera/discussions/1#6523d448b623a04e6c2f118a
>
> From the logs I see this error
>
> TooBigRows: Rows from parquet row groups a…
-
### Proposal Details
The following code exists in many projects, and developers actually want [0 1 2], but due to the initialization error of slice, the final result is [0 0 0 0 1 2]
```go
pa…
-
For example, https://datasets-server.huggingface.co/size?dataset=c4 only provides a global `partial: true` field and the response does not explicit that the "train" split is partial, while the "test" …
-
We currently return:
```
"sha256": "https://github.com/mlcommons/croissant/issues/80"
```
See https://github.com/mlcommons/croissant/issues/80. cc @marcenacp
-
Previously with an older release of dsq I could do a basic SQL select on a parquet file.
With the latest release (`0.2.0`), I get this error:
`panic: Missing type equality condition for unknown …