-
Hi @jwills ! Thank you for this amazing work!! I've used it to teach datathinking.org at University of Tartu and @PrincetonUniversity.
However, I'm getting stuck in switching from a local CSV file…
-
I saw that @asmuth implemented it here: https://github.com/ironSource/parquetjs/blob/master/lib/writer.js#L52
But I'm wondering if someone has a simple example of how to use this.
I've got it most…
-
### Apache Iceberg version
1.5.2 (latest release)
### Query engine
None
### Please describe the bug 🐞
Hi,
I am experimenting with setting up Iceberg locally and I am trying to connect to a cus…
-
Is there any interest in using Parquet datasets to benchmark, particularly for the 50GB dataset case? Parquet is very common for large-scale data analytics, and as far as I know, most if not all of th…
-
Hello,
I am encountering an issue when trying to read a parquet file using `read_parquet` with an `S3FileSystem` created with `s3_bucket()`.
I created a worker that get the last parquet file i…
-
Was struggling to understand why creating a dask dataframe from a large list of parquet files was taking ages. Eventually tried disabling query planning and saw normal timing again. These are all rela…
b-phi updated
6 months ago
-
**Describe the bug**
Running the make command on 2021 mac m1 pro fails with ` ERROR: Failed building wheel for pyarrow`
**To Reproduce**
Steps to reproduce the behavior:
1. Run the command `mak…
-
I'm trying to serialize a pandas DataFrame containing custom objects to parquet. Here is some example code:
```java
import pandas as pd
import pyarrow as pa
class Foo:
pass
df = pd.Dat…
-
(CERN AFS support here..)
We occasionally have CMS users asking for more AFS space for running HiggsDNA. Could you perhaps document a "best practice" for your users at CERN that
* does (as much as p…
-
vaex 4.0.0 includes the ability to open parquet files in an out of core context. This means that the arrow file will no longer be required when this is released.
However from testing it is still be…