parquet-files Search Results

1000+ results
for parquet-files

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

apache/datafusion #5950

Allow providing Arrow schema when scanning Parquet files

### Is your feature request related to a problem or challenge? When scanning Parquet files, we'd often like to provide an expected schema, since: 1. The Parquet files might not all have an identic…

wjones127 updated 5 days ago
1
hubverse-org/hubverse-transform #14

hubverse-transform: supply a parquet schema

Currently, hubverse-transform infers the parquet schema to apply when converted incoming model-output data to parquet. Because each file arrives and is transformed as a single unit, pyarrow has a limi…

bsweger updated 1 week ago
26
google/fhir-data-pipes #1011

Support for storing parquet files in AWS S3 is not available

![image](https://github.com/google/fhir-data-pipes/assets/92530372/af47dbea-883b-442b-a283-dbc37aca4cd3)

Charantl updated 1 week ago
3
pola-rs/polars #17254

`scan_parquet` expects columns to be in the same order iff `…

### Checks - [X] I have checked that this issue has not already been reported. - [X] I have confirmed this bug exists on the [latest version](https://pypi.org/project/polars/) of Polars. ### Reprodu…

kszlim updated 1 day ago
1
mcpar-land/zip-to-parquet #3

Recursive unzip flag

`--recursive / -r`: If a `.zip` file inside of a `.zip` file is encountered, go inside of it and start adding files within it to the final parquet, instead of adding the `.zip` file iteslf. Should be …

mcpar-land updated 2 weeks ago
1
modin-project/modin #7325

Poor performance of df.insert and df.to_parquet

### Modin version checks - [x] I have checked that this issue has not already been reported. - [X] I have confirmed this bug exists on the latest released version of Modin. - [X] I have confirmed t…

yx367563 updated 1 day ago
23
prestodb/presto #22595

[native] Native workers not writing Parquet data files for W…

Native worker not writing Parquet data files for WriterVersion v1 (PARQUET_1_0) ## Your Environment * Presto version used: 0.288-SNAPSHOT * Storage (HDFS/S3/GCS..): S3 * Prestissimo Setup on L…

agrawalreetika updated 1 month ago
2
trinodb/trino #21808

Many small data files created for a table, unable to optimiz…

### Description Cluster: 1 coordinator, 3 workers Trino version: 441 Connector: iceberg Hello! I'm running a query to create a new iceberg table from an existing iceberg table. Something like th…

jhatcher1 updated 1 month ago
2
apache/arrow #41863

[Python][Parquet] Support LZ4_RAW for parquet writing

### Describe the enhancement requested `pyarrow.dataset.write_dataset(compression='lz4_raw')` currently fails with: ``` Traceback (most recent call last): File "/work/projects/lisa/testpyarrow…

douglas-raillard-arm updated 1 month ago
12
hubverse-org/hubData #37

Investigate connect_hub performance when used against large,…

**Background** We've previously identified non-uniform parquet schemas as a performance culprit when using `hubData` (because you have to do a `collect()` before you can filter). That issue is logged…

bsweger updated 1 week ago
10

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for parquet-files

1000+ results
for parquet-files