mukunku / ParquetViewer

Simple Windows desktop application for viewing & querying Apache Parquet files
GNU General Public License v3.0
783 stars 98 forks source link

[BUG] Cannot open file because of missing column, but column is present. #70

Closed MCRE-BE closed 1 year ago

MCRE-BE commented 1 year ago

Parquet Viewer Version What version of Parquet Viewer are you experiencing the issue with? 2.5.1.0

Where was the parquet file created? Pandas Python - 1.5.3

Sample File Example.zip

Describe the bug Cannot open the file, but file can be opened in Python as the column flagged as missing is present. Likely because of a trailing "." (as that is the only mismatch between the file and the bugreport)

Screenshots If applicable, add screenshots to help explain your problem. image

mukunku commented 1 year ago

Looks like the underlying parquet.net library has an issue with column names ending in periods. I'm going to see if I can take a look to see if it's simple enough for me to fix. Maybe next week.

I'd normally open a bug ticket in that repo to see if someone else can get to it but I'm feeling a bit lazy 🙂 . If you're up for it opening a bug ticket there might significantly speed up getting a fix for this.

MCRE-BE commented 1 year ago

Top.

I'll open a bug report after my holidays 😇

MCRE-BE commented 1 year ago

@mukunku : fix will be pushed in 4.6.0 release of parquet.net

mukunku commented 1 year ago

Thanks so much for following up on this. I updated the parquet-dotnet library to v4.6.0 and can open your file now!

https://github.com/mukunku/ParquetViewer/releases/tag/v2.6.0.1

Closing out this ticket 🙌🏼