apache / iceberg

Apache Iceberg
https://iceberg.apache.org/
Apache License 2.0
6.25k stars 2.18k forks source link

How can I quickly insert data into an iceberg table in a Python environment? #8801

Open 1taoze opened 12 months ago

1taoze commented 12 months ago

Query engine

Environment: Presto + Hive-Metastore + Minio

Question

Is it too slow to import data into an iceberg table that exists in minio through presto. Is there any fast way to import data?

amogh-jahagirdar commented 12 months ago

I guess there are a few different topics here:

Is it too slow to import data into an iceberg table that exists in minio through presto.

Slow or fast depends on your requirements, your ingestion pattern, table layout etc :) If you could provide more details that would be helpful. I would say, I don't know about Presto's support for Iceberg, but Trino has a actively maintained Iceberg connector.

How can I quickly insert data into an iceberg table in a Python environment?

Python write support is actively being worked on, cc @Fokko , you can follow the progress https://github.com/apache/iceberg-python , contributions/feedback always welcome!

github-actions[bot] commented 2 weeks ago

This issue has been automatically marked as stale because it has been open for 180 days with no activity. It will be closed in next 14 days if no further activity occurs. To permanently prevent this issue from being considered stale, add the label 'not-stale', but commenting on the issue is preferred when possible.