-
Currently building off of master is broken because of:
```
[ERROR] Failed to execute goal com.diffplug.spotless:spotless-maven-plugin:2.30.0:check (default) on project parquet-format-structures:…
-
https://arrow.apache.org/docs/python/filesystems.html
Like Kerchunk, it should be possible to create a virtual parquet dataset from an existing Zarr store. Here, it would also provide a table-like vi…
-
A tools that verifies encryption of parquet files in a given folder. Analyzes the footer, and then every module (page headers, pages, column indexes, bloom filters) - making sure they are encrypted (i…
-
If you see the following error below in Anypoint Studio after you add the dependency to your pom.xml file, you need to edit the pom.xml for the audience-annotations file.
You can find that here: `/…
-
Analyze size of parquet files and use different compression and chunk sizes... parquet-tools inspect shows negative compression ratios
-
I'm trying to read a file which contains ENUM types, however (e.g. `parquet-tools schema` shows something like `required binary entryMethod (ENUM);`) but `parquet.ParquetReader.openFile()` just throws…
-
Notes on how to use the `parquet-tools` from Hadoop to inspect Parquet files.
https://stackoverflow.com/questions/36140264/inspect-parquet-from-command-line
I suspect we will end up using these a…
-
Parquet file import problem
The test file is displayed in the Parquet file opening tools without any problems
But in Zui, in the Format section, even though the Parquet file is selected, the imp…
-
### Describe the bug, including details regarding any error messages, version, and platform.
If I do
```
import polars as pl
import pyarrow.parquet as pq
df=pl.DataFrame({'a':[1,2,3]})
df.wr…
-
### Describe the enhancement requested
PyArrow's [documentation](https://arrow.apache.org/docs/python/parquet.html) says how it handles reading and writing of parquet files
> Apache Arrow is an id…