-
I noticed that querying a small number of Parquet files takes longer than I would expect. I think it's worth looking into where the time is being spent and building some tooling around doing profiling…
-
**Is your feature request related to a problem? Please describe.**
I am trying out daft as an alternative to spark, in our current use of spark we make use of the feature that even if dataframe is em…
-
### Describe the bug, including details regarding any error messages, version, and platform.
We experienced a segfault reading an encrypted Parquet file and traced this down to `EVP_DecryptUpdate` be…
-
### Description
Some parquet files may contain incorrectly calculated statistics (e.g. some of the ones written by older versions of polars containing UInt64 statistics had incorrect min/max). Beca…
-
xref: https://github.com/uscuni/simplification/pull/80#pullrequestreview-2140824961
Proposed structure:
```
-- data
| -- 1133
| | -- no_degree_2
| | | -- 1133.parquet
`…
-
### Description
### Acceptance Criteria
- [ ] [criteria 1]
- [ ] [criteria 2]
- [ ] [criteria 3]
### Additional Information
Child of Feature: https://github.com/eclipse-tractusx/…
-
Motivation:
One of the issue with parquet is that every column has to be read as a whole (contrary to a csv where an offset can be added and lines can be read individually) ... so if lets say ... I…
-
I was opening a Parquet file that contained structs but I'm not able to see or query the contents of those structs.
Since DuckDB does support these types, I was expecting them to work as well.
The…
-
https://github.com/NOAA-OWP/t-route/blob/5bef6342109fd09db27be731225e7c28e968061b/src/troute-network/troute/AbstractNetwork.py#L754
I have run into this in different scenarios, and typically end up…
-
I have some problems about parquet files when I run preprocessing codes are as follows:
(TimeVAE) aohuijie@ae0805e8ceb1:~/timeVAE/processing$ python preprocessors.py
Traceback (most recent call l…