-
Implement some CLI binaries for working with ORC files such as reading schema, getting stats, etc.
Tools to have:
- View footer metadata
- Initial version: https://github.com/datafusion-contr…
-
This write-up was prompted by [this zulip topic](https://neuroinformatics.zulipchat.com/#narrow/channel/406001-Movement/topic/Output.20formats).
## The problem
We have so far taken a pluralistic…
-
# `spatialdata` support in `segger`
Describing use cases and a possible strategy to enable `spatialdata` support.
## Use cases
These use cases can be considered as incremental goals, to accomplish …
-
write with csv
```
func main() {
var err error
md := []string{
"name=id, type=INT96",
"name=name, type=BYTE_ARRAY",
}
fw, err := local.NewLocalFileWriter("csv.parquet")
if err != n…
-
Many tools have the ability to read natively from blob storage; the operations on blob storage look similar to POSIX FS at a high layer, but at a low layer there is nuance that makes them quite differ…
-
I am unable to maintain it and it has never been released, cc @elferherrera .
-
### Apache Iceberg version
1.6.1 (latest release)
### Query engine
Hive
### Please describe the bug 🐞
Insert is working fine and in hdfs also the .parquet file is getting generated and when I dec…
-
When using parquet-tools on a parquet file with null records the null columns are omitted from the output.
Example:
```java
scala> case class Foo(a: Int, b: String)
defined class Foo
sc…
-
parquet-tools merge is extremely time- and memory-consuming when used with block-option.
The merge function builds a bigger file out of several smaller parquet-files. Used without the block-op…
-
After following the instruction [here]( fro running parquet tools project am getting the following error
`[ERROR] /Users/jecihjoy/Downloads/parquet-mr-apache-parquet-1.11.1/parquet-tools/src/m…