mukunku / ParquetViewer

Simple Windows desktop application for viewing & querying Apache Parquet files
GNU General Public License v3.0
689 stars 82 forks source link

Add support for List type fields #33

Closed emanueol closed 3 years ago

emanueol commented 3 years ago

Parquet Viewer Version 2.3.0.40676

Where was the parquet file created? AWS EMR > Apache Spark

Sample File cant do, its 22MB with sensitive information and i dont have control of producer.

Describe the bug Parquet contains a field which is a nested table (array of strings).. so I guess ParquetViewer is unable to open these type of parquets. I cant share parquet data file, but im attaching detailed error popup screenshot - basically its refering to the nested field.

Screenshots parquet-error-nestedtable parquet-error-nestedtable2

Additional context

Note: This tool relies on the parquet-dotnet library for all the actual Parquet processing. So any issues where that library cannot process a parquet file will not be addressed by us. Please open a ticket on that library's repo to address such issues.

mukunku commented 3 years ago

Support for List types has been added as of v2.4 (nested lists still not supported though): image

Let me know how it works for your file. If you experience any issues please comment on #37

Here are some sample files with list values: