Closed wjones127 closed 1 year ago
This looks great! Really excited!
Some blog post ideas:
Let me know if I should make issues for the blog posts. I'm fine tracking them elsewhere too. I'll want delta-rs community reviews, but we can just do those in the Slack chat. Thanks for putting this together.
@MrPowers I'm interested in taking up Delta Lake + AWS Lambda blog post. Can you help me out with the process?
@wjones127 maybe a silly question but why would you still need the Operations API that only uses data fusion (in rust) after introducing the ADBC API?
From the design document I can see any query engine can potentially be used with ADBC.
Why implement optimize and zorder when databricks is going to the opposite side with Liquid Clustering. By the moment delta-rs implement this, databricks will have made Liquid Clustering the default.
Why implement optimize and zorder when databricks is going to the opposite side with Liquid Clustering. By the moment delta-rs implement this, databricks will have made Liquid Clustering the default.
But they are already implemented in delta-rs.
Why implement optimize and zorder when databricks is going to the opposite side with Liquid Clustering. By the moment delta-rs implement this, databricks will have made Liquid Clustering the default.
Delta-rs team actually implemented these two features before the announcement of delta 3.0 and liquid clustering. To be honest, delta 3.0 and liquid clustering came out kinds of unexpectedly
@rtyler
Do you plan to support Generated Columns
(Writer Version 4) in delta-rs ?
Work committed to
These are projects current contributors are working on.
Projects seeking contributors
In addition to smaller issues labelled good-first-issue, these are some larger projects that we could use some help on. Most of them will be implemented as part of the
operations
module in the Rust source and can later be exposed to Python and other bindings.DELETE
operation (#832)UPDATE
operation (#1126)MERGE
operation (#850)OPTIMIZE
operation, which currently only works on append-only tables (#1125)VACUUM
with bulk requests (#405, apache/arrow-rs#2615)