-
**Describe the problem you faced**
We encountered an issue with MOR table that utilizes metadata bloom filters and Parquet bloom filters, and has enabled statistics. When attempting to query data, …
bk-mz updated
4 months ago
-
I've upgraded to the latest version `0.0.9` and there are a few rough edges with reading parquet from Python.
## Problem scenario
If I have a parquet file with two columns, `id` and `name`, of typ…
-
## Feature Request
**Is your feature request related to a problem? Please describe:**
[parquet](https://parquet.apache.org/) is a compressed, efficient columnar data format. Lightning has alread…
-
### Describe the enhancement requested
Optimisation to https://github.com/apache/arrow/issues/37511
Child of https://github.com/apache/arrow/issues/18014
When reading from Azure blob storage the …
-
I got to know that arrow IPC files have significant performance benefits because the loading is much much faster and compression, while worse than parquet, is still pretty good.
The documentation s…
-
Would it make sense to be able to introduce support for `avro` schema for `TypedDataSet`?
The current code defines schema based on the `SparkSQL` "language": https://github.com/typelevel/frameless…
-
I can't use Parquet.jl because there is a problem reading Date-typed columns. They are reading into Julia DataFrames as an Int32 -- I'm pretty sure parquet files are supposed to define the schema and …
-
## Steps To Reproduce
After a recent update to chrome we get the following error on a page with dropdown filters. The same page worked in chrome before the update and it works in Firefox still. See…
-
## Bug
### Describe the problem
In docs https://docs.delta.io/latest/delta-update.html#performance-tuning we have spark.databricks.delta.merge.repartitionBeforeWrite.enabled
> This is enabled…
-
Hello!
I just came across this package. I'm aware that Go is generally very fast. Do you happen to know how the read speeds for Stata files with this package compare to Stata or Python? Is the read…