-
#### Problem description
I'd be curious to get opinions on whether serialization/deserialization should be supported for the file-like objects at the core of this library. This would be useful for …
-
We are cleaning up the CKAN core CLI (https://github.com/ckan/ckan/issues/4639) and there is the `rdf-export` command that dumps all datasets in the site as RDF in a folder.
The implementation isn'…
-
When collecting a CC/FIPS dataset, we could collect the tool versions used to produce such datasets. When the user fetches `from_web_latest()`, we could display a warning when different version is run…
-
When resolving transformers (#5657) in universal mode, we resolve datasets v2.14.4, v2.20.0 and fsspec v2024.5.0, v2024.6.1 while datasets v2.20.0 and fsspec v2024.5.0 would work for both branches in …
-
I have a workflow for detecting duplicates in a dataset that I am looking to deploy.
I'd like to be able to configure the blocking rules used to generate candidate pairs as part of the CLI that run…
-
DataSets may be impacted by BinaryFormatter. We need to investigate serialization/deserialization flow for DataSets, understand what impact BinaryFormatter might have, and determine next steps from th…
-
This is a major feature release.
Spec: https://github.com/MadcowD/ell/blob/cd64ab9bb0d3a09195fef7a32ef77ac5d7e6c912/docs/ramblings/evalspec.md
Ramblings: https://github.com/MadcowD/ell/blob/cd64ab9…
-
### Bug description
When exporting certain datasets, the serialized version of the dataset is invalid.
This causes validation with marshmallow to fail when you try to import it again.
### How…
-
I'm not super happy with the `vds.virtualize.to_` accessor name. It's confusing - really the "virtualize" accessor doesn't "virtualize" anything, the references are already virtual, it just allows you…
-
I would like to be able to read virtual references back out from an icechunk store into VirtualiZarr `ManifestArray` objects.
Note this issue is the icechunk equivalent of https://github.com/zarr-d…