hannes / miniparquet

Library to read a subset of Parquet files
Other
43 stars 7 forks source link

Add Python Support #10

Closed Mytherin closed 4 years ago

Mytherin commented 4 years ago

This PR adds Python support in the miniparquet module. Currently the module has a single function miniparquet.read(filename) that reads a miniparquet file and converts it into a dictionary of lists.

Several open issues still:

But then I only had 45 minutes to implement it :)

Usage:

import miniparquet

print(miniparquet.read('tests/data/alltypes_plain.parquet'))