hangxie / parquet-tools

Utility to deal with Parquet data
BSD 3-Clause "New" or "Revised" License
76 stars 11 forks source link

Feature: deal with PARGO_PREFIX_ #447

Open hangxie opened 4 days ago

hangxie commented 4 days ago

This is to address https://github.com/hangxie/parquet-tools/issues/446, which seems cannot be resolved by upstream (parquet-go) due to go's limitation.

Since PARGO_PREFIX_ is hardcoded in parquet-go, we can just trim it from all field names, there can be a CLI flag to disable this translation.

hangxie commented 1 day ago

schema command not supports option to remove the prefix https://github.com/hangxie/parquet-tools/pull/448

cat command is still pending as I believe I'd better refactor cat command to go with MarshalJSON route, I'm still reading (my own code! :sweat_smile:) to make sure this is the right approach.