ironSource / parquetjs

fully asynchronous, pure JavaScript implementation of the Parquet file format
MIT License
348 stars 175 forks source link

Any way to Read parquet records before appending to file #120

Open govthamreddy opened 3 years ago

govthamreddy commented 3 years ago

Is there anyway to read the parquet converted record before writing it into parquet file. My requirement is to create a parquet file directly into azure data lake without storing it locally.

dobesv commented 3 years ago

Using the parquets package I've been able to stream out parquet data without saving to a temporary file, it supports node streams. It will write out the data one row group at a time, though, so you do need enough memory to hold a full row group. This is a limitation of parquet.

vishald2509 commented 1 year ago

Using the parquets package I've been able to stream out parquet data without saving to a temporary file, it supports node streams. It will write out the data one row group at a time, though, so you do need enough memory to hold a full row group. This is a limitation of parquet.

can you please provide more insight into the code perspective? It will be very helpful.