Closed unoexperto closed 3 years ago
Thanks for the suggestion, I'll be adding zstd-jni-1.4.5-12.jar
to the build over the next few days. It would be greatly appreciated if you were able to share a small sample file, but I know this isn't always possible.
I'm not having much luck getting this to work in Windows - the Avro and Parquet APIs are flimsy at the best of times and there's not much out there on Zstd support. I will take another stab this weekend but for the time being I may just have to leave instructions for how to manually load the jar like you suggested.
I'm not going to be fixing this in the short term - I haven't been able to get it working on Windows. As I mentioned in my last message, the APIs are difficult enough to work with for basic functionality, never mind newer compressions such as this. I will update documentation to let people know about your workaround, but without an external patch then I won't be able to fix this.
@unoexperto please can you retry with just the latest version of the plugin and no other changes? There has been lots of work around ZSTD in parquet-mr:1.12.0
, so I'm hoping it works now.
Using just the latest version of the plugin I was able to read a file generated via:
import pandas as pd
import pyarrow.parquet as pq
import pyarrow as pa
parquetFilename = "test.parquet"
df = pd.DataFrame(
{
"num_legs": [2, 4, 8, 0],
"num_wings": [2, 0, 0, 0],
"num_specimen_seen": [10, 2, 1, 8],
},
index=["falcon", "dog", "spider", "fish"],
)
df = pa.Table.from_pandas(df)
pq.write_table(df, parquetFilename, compression="zstd")
so I'll close this ticket; let me know if you're still experiencing issues.
@benwatson528 It works! Thank you!
I get following exception
I fixed it by putting
zstd-jni-1.4.4-7.jar
into~/.local/share/JetBrains/IntelliJIdea2020.2/intellij-avro-parquet-viewer/lib/
but it would be nice if it's bundled into release build.Thanks a lot !!