parquet Search Results - Githubissues

1000+ results
for parquet

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

duckdb/duckdb_mysql #100

COPY from Parquet seems to keep the Parquet file open (so th…

### What happens? After copying a Parquet file into a MySQL table, the Parquet file seems to be locked and therefore cannot be deleted until the process is killed. We use tmp Parquet files to bridge…

pronzato updated 4 days ago
1
apache/parquet-java #3067

Include FAPEC compressor support to Parquet?

### Describe the enhancement requested FAPEC is a high-performance data compression algorithm with many options, based on efficient entropy coding and including several pre-processing algorithms for …

PortellJ updated 4 days ago
1
astropy/astropy #17257

API inconsistency between parquet.votable and votable.parque…

For votable.parquet we have `column_metadata` while for parquet.votable we have `metadata`. Now, I have kept this inconsistency in #16375 as we have already run into the issue that the metadata we …

bsipocz updated 3 weeks ago
6
apache/arrow #44599

[Parquet] >2GiB Memory Leak on reading single parquet metada…

I have a ~1.5TiB, ~1.7k files parquet dataset with an additional `_metadata.parquet` file containing metadata of all row groups. The `_metadata` file was written with the mechanism described in the [d…

jonded94 updated 4 days ago
9
databendlabs/databend #16897

Feature: load parquet/ndjson support case sensitive

**Summary** currently， when reading parquet file, the fields of file schema is modified that all field names are turned to lowercase. # Solution 1 parquet/ndjson add format option case_s…

youngsofun updated 1 day ago
2
apache/paimon #4568

[Bug] Caused by: java.lang.ClassCastException: java.lang.Byt…

### Search before asking - [X] I searched in the [issues](https://github.com/apache/paimon/issues) and found nothing similar. ### Paimon version 0.9 ### Compute Engine Spark ### Minimal reprodu…

ljingz updated 9 hours ago
2
lakehq/sail #301

Slow wide table left outer join on local machine

I've tried using sail for local development of spark jobs. But running simple query on dataset that has size of few GBs makes sail slower than spark. When join is not there then query runs within 10…

eredzik updated 1 day ago
4
Eventual-Inc/Daft #3329

Parquet reader support for RLE-encoded boolean columns

### Describe the bug Daft doesnt support some feature in the parquet file format for boolean columns. ### To Reproduce ``` import polars as pl import daft df = pl.DataFrame( {"a": [1, 2, …

uditrana updated 3 days ago
2
dask/dask-expr #1169

Predicate pull-up optimization

https://duckdb.org/2024/11/14/optimizers.html#filter-pull-up--filter-pushdown has a nice description of filter pull up, an optimization in DuckDB that I'd like to implement in dask-expr as a learning …

TomAugspurger updated 3 days ago
1
streamingfast/substreams-sink-files #9

High memory consumption using parquet encoder

Using the `feature/parquet` branch, I get really high memory usage from running this configuration: ``` substreams-sink-files run \ eos.substreams.pinax.network:443 \ https://github.com/pinax-netw…

coutug updated 1 week ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for parquet

1000+ results
for parquet