parquet-files Search Results

1000+ results
for parquet-files

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NVIDIA/spark-rapids #10835

[FEA] Test V2 Parquet encoded files with reader

We have an issue https://github.com/NVIDIA/spark-rapids/issues/9058 to enable parquet writes in V2 format. We would like to also test the reader, and test combinations of GPU/CPU encoding and decoding…

abellina updated 4 weeks ago
1
pola-rs/polars #17255

PanicException when doing a `scan_parquet` on files with mis…

### Checks - [X] I have checked that this issue has not already been reported. - [X] I have confirmed this bug exists on the [latest version](https://pypi.org/project/polars/) of Polars. ### Reprodu…

kszlim updated 23 hours ago
1
audeering/audformat #440

Add possibility to iterate over table data (streaming)

To support loading large tables, that might not fit into memory, it would be a good idea to add an option to `Table.get()` (or another method?), to read the data piece-wise. Let us first create an …

hagenw updated 3 days ago
5
ocean-tracking-network/otn-workshop-base #104

lesson about using parquet files

R lesson showing how to use detection extracts in parquet files and why they're better than CSV

CaitlinBate updated 2 months ago
1
lancedb/lance #2514

filtering on timestamp column not working when tried with du…

Tried filtering on a timestamp column using duckdb against lance, and it did not work. The same query worked against a parquet file. ```python import pyarrow as pa import lancedb import du…

rkunnamp updated 3 days ago
2
fsspec/kerchunk #455

Parquet reference files from git with simplecache

Hi, I thought it might be a good idea to put the lazy Reference parquet files into git. Using this data directly from git is somehow not possible - e.g. our gitlab server also do not allow byte-ran…

wachsylon updated 1 month ago
1
huggingface/dataset-viewer #1504

Validate source Parquet files before linking in refs/convert…

While trying to read a parquet file from dataset revision refs/convert/parquet (generated by datasets-server) with duckdb, it throws the following error: ``` D select * from 'https://huggingface.…

AndreaFrancis updated 1 week ago
5
pgspider/parquet_s3_fdw #27

Can it load gzipped parquet files?

AWS has an option to export an RDS db snapshot to parquet files in an S3 bucket. But the resulting files are gzipped Is it possible to directly load them with the fdw, or I need to run a batch job …

anentropic updated 1 month ago
1
r-dbi/bigrquery #614

Exporting data to Google Cloud Storage in Parquet format ava…

Hello all. I'm trying to export queried data from a BigQuery database table. Since the resulting table can be large (2.5GB or more), I followed the suggestion "Larger datasets" from the ` bq_table_do…

pegoenrico updated 4 days ago
3
pola-rs/polars #17163

Support writing to multiple files in a directory with `write…

### Description I'm currently using polars to perform ETL where the final destination is in a data lake, and there's an incompatibility when working with LazyFrames that's causing significant perfo…

rgasper updated 4 days ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for parquet-files

1000+ results
for parquet-files