-
ROOT's new `RNTuple` columnar data storage is not going to support dynamic polymorphism (as opposed to `TTree`). One such use case in our data formats is `edm::OwnVector` that effectively behaves as `…
-
Previous discussion: https://github.com/apache/arrow-datafusion/issues/4707
Though the ORC format is not as widely used as parquet in arrow-rs and datafusion related projects, there are still some …
-
### What version of Materialize are you using?
708f88f6cb1459f0c3f85753386a7941c845a78f
### What is the issue?
I found this while looking into https://github.com/MaterializeInc/materialize/is…
-
23.1 must-haves:
- [x] introduce local fastpath
- [ ] figure out whether we want to support `Get` requests
- [ ] figure out what to do with tracing (i.e. `TraceKV` flag of `cFetcher`)
- [ ] what e…
-
# Context
All data sets used in **geobr** are currently stored in the format of [GeoPackage](https://www.geopackage.org/) `.gpkg` files. The choice for GeoPackage was an easy one. GeoPackage is a ve…
-
Even though the storage format has a [specification](https://automerge.org/automerge-binary-format-spec), it looks like the internals are only crate-visible. I'd be interested in having them `pub`, fo…
kim updated
10 months ago
-
If I understand Kotlin dataframes correctly, computations are done directly in the JVM.
For many use-cases, the optimized data storage and vectorized computations of DuckDB could be very useful in te…
-
### Describe the enhancement requested
The documentation for [Arrow Columnar Format](https://arrow.apache.org/docs/format/Columnar.html#ipc-file-format) suggests that the separate Feather project has…
-
(Creating this issue for visibility so people interested can join the discussion... )
## Overview
Load Apache ORC formatted data natively into TensorFlow from file system supported by TensorFlow, e…
-
To be assigned to @benjeffery once he's a member of our org!