utdemir / distributed-dataset

A distributed data processing framework in Haskell.
BSD 3-Clause "New" or "Revised" License
114 stars 5 forks source link

Parquet Support #17

Open utdemir opened 5 years ago

utdemir commented 5 years ago

Parquet is a commonly used data format, but sadly Haskell ecosystem is lacking a mature library.

If we implement a library to encode/decode Parquet files, we can both use it in this library and it will be a nice addition to the Haskell ecosystem.

There is one old incomplete attempt implementing a Parquet reader in Haskell here: https://github.com/laurencer/parquet-haskell

utdemir commented 4 years ago

PR #26 is working on this.