apache / datafusion

Apache DataFusion SQL Query Engine
https://datafusion.apache.org/
Apache License 2.0
6.37k stars 1.2k forks source link

Release DataFusion `42.0.0` #11902

Open alamb opened 3 months ago

alamb commented 3 months ago

Is your feature request related to a problem or challenge?

Tracking ticket for next release, also a place to track desired inclusions

Last release was https://crates.io/crates/datafusion/41.0.0 Aug 11, 2024 so next major release would be around September 10, 2024

List if items that would be good to get into this release:

Additional context

Items to fix before release

alamb commented 2 months ago

FYI this release is being planned for this week: https://lists.apache.org/thread/jgqktymcw8xtshth0lvr3317s77lft16

alamb commented 2 months ago

We (InfluxData) found two issues while upgrading to a pre-release version of DataFusion that are not yet filed. We are working to open tickets for them (after confirming they are indeed issues upstream)

alamb commented 2 months ago

In terms of using StringView by default, https://github.com/apache/datafusion/issues/11682, I hope to turn that on very soon after 42 is released (so it gets maximum bake time on main before we release version 43)

alamb commented 2 months ago

Here is one issue (a regression) that I think we should fix before release:

timsaucer commented 2 months ago

Another regression to fix before the release if possible: https://github.com/apache/datafusion/issues/12425

alamb commented 2 months ago

Another regression to fix before the release if possible: #12425

I added a section to the ticket's description for "items to fix"

andygrove commented 2 months ago

@alamb I plan on starting the release prep today unless there are other issues?

alamb commented 2 months ago

@alamb I plan on starting the release prep today unless there are other issues?

The only other thing I think we might want to consider is https://github.com/apache/datafusion/pull/12414 that @wiedld and @berkaysynnada are working on.

I have just filed https://github.com/apache/datafusion/issues/12446 to track it. However, since the regression has existed since 40.0.0 I don't think it should block the release of 42.0.0

alamb commented 2 months ago

@andygrove there are a few PRs I was holding off merging until you do the release cut

Do you have any estimate on when you might make the release candidate?

andygrove commented 2 months ago

I was waiting to see if https://github.com/apache/datafusion/issues/12446 would be resolved, but I can go ahead and cut the release candidate today

alamb commented 2 months ago

Release candidate thread: https://lists.apache.org/thread/bdnfy1skj5vs3s4nx44kycdd6lb917b2

I also filed https://github.com/apache/datafusion/issues/12470 to track releasing the next version 43.0.0

andygrove commented 2 months ago

Updates:

I have some post-release tasks to do: