biolab / orange3

🍊 :bar_chart: :bulb: Orange: Interactive data analysis
https://orangedatamining.com
Other
4.72k stars 993 forks source link

Read and Write Apache Parquet #6699

Open simonaubertbd opened 6 months ago

simonaubertbd commented 6 months ago

What's your use case? Apache Parquet ( https://parquet.apache.org/ ) becomes more and more popular and I think it's like a standard now in the data community, this is no more restricted to Hadoop People. Qlik supports it, Alteryx will support it in the next release, even LibreOffice is working on it, etc, etc. Why? -opensource format -fast

What's your proposed solution? To have Orange Data Mining support Apache Parquet files for read and write.

Are there any alternative solutions? To convert parquet files before but seems useless

markotoplak commented 6 months ago

Makes sense indeed. Orange lacks a robust and fast file format.

When I need fast reading, I resort to picked tables, but a robust format like that would be a big improvement.