parquet-tools Search Results

1000+ results
for parquet-tools

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

duckdb/dbt-duckdb #193

Issue using `read_csv` in a model file for a large CSV being…

Hi @jwills ! Thank you for this amazing work!! I've used it to teach datathinking.org at University of Tartu and @PrincetonUniversity. However, I'm getting stuck in switching from a local CSV file…

jaanli updated 1 year ago
6
ironSource/parquetjs #76

Example usage of ParquetWriter.openStream?

I saw that @asmuth implemented it here: https://github.com/ironSource/parquetjs/blob/master/lib/writer.js#L52 But I'm wondering if someone has a simple example of how to use this. I've got it most…

aconanlai updated 7 months ago
8
apache/iceberg #10490

Custom s3 endpoint: Unable to execute HTTP request: Remote h…

### Apache Iceberg version 1.5.2 (latest release) ### Query engine None ### Please describe the bug 🐞 Hi, I am experimenting with setting up Iceberg locally and I am trying to connect to a cus…

samueljackson92 updated 5 months ago
2
duckdblabs/db-benchmark #32

Benchmark with Parquet

Is there any interest in using Parquet datasets to benchmark, particularly for the 50GB dataset case? Parquet is very common for large-scale data analytics, and as far as I know, most if not all of th…

srilman updated 1 year ago
7
apache/arrow #12118

[R] Error when reading parquet file using FileSystem object

Hello, I am encountering an issue when trying to read a parquet file using `read_parquet` with an `S3FileSystem` created with `s3_bucket()`. I created a worker that get the last parquet file i…

everron updated 1 year ago
5
dask/dask-expr #1061

Reading a list of S3 parquet files with query planning enabl…

Was struggling to understand why creating a dask dataframe from a large list of parquet files was taking ages. Eventually tried disabling query planning and saw normal timing again. These are all rela…

b-phi updated 6 months ago
11
transferwise/pipelinewise #857

Fails to build on 2021 mac m1 pro

**Describe the bug** Running the make command on 2021 mac m1 pro fails with ` ERROR: Failed building wheel for pyarrow` **To Reproduce** Steps to reproduce the behavior: 1. Run the command `mak…

reubano updated 1 year ago
7
apache/arrow #31267

[Python] Allow serializing arbitrary Python objects to parqu…

I'm trying to serialize a pandas DataFrame containing custom objects to parquet. Here is some example code: ```java import pandas as pd import pyarrow as pa class Foo: pass df = pd.Dat…

asfimport updated 2 years ago
9
sam-may/higgs_dna_tutorial.github.io #2

RFE: better LXPLUS-specific docs that do not rely on AFS?

(CERN AFS support here..) We occasionally have CMS users asking for more AFS space for running HiggsDNA. Could you perhaps document a "best practice" for your users at CERN that * does (as much as p…

jmuf updated 1 year ago
1
askap-vast/vast-pipeline #405

Remove arrow file generation when vaex 4.0.0 is available

vaex 4.0.0 includes the ability to open parquet files in an out of core context. This means that the arrow file will no longer be required when this is released. However from testing it is still be…

ajstewart updated 2 years ago
1

上一页 1...18 19 20 21 22 23 24...100 下一页

1000+ results for parquet-tools

1000+ results
for parquet-tools