-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Release DataFusion 46.0.0
#14123
Comments
I think @xudong963 said they might be interested in running this one:
|
@alamb I'll also do some updates in the issue summary. Considering that this is the first time I've been involved in this process, could you please remind me if some critical time comes up and I don't do anything about it? |
Yes of course. Thank you @xudong963 I am planning to be involved in this one too -- perhaps I can help by testing the release e.g. with delta-rs and influxdb_iox |
@xudong963 when would you like to start making the release? Maybe we should targe the week of Feb 24 🤔 |
yes, the week is suitable. |
I've created a draft PR to upgrade Comet to use latest DataFusion: |
I noticed that older releases have not been removed from https://dist.apache.org/repos/dist/release/datafusion/. Perhaps this can be cleaned up as part of this release. |
@xudong963 are we still thinking of trying to get the release ready this upcoming week? I will try and focus my efforts starting tomorrow on ensuring the bugs listed in "Bugs that would be good to fix" are in |
@alamb In From experience, It seems that we should leave one week to test with |
I think it is a good idea. I'll try and make a PR to delta-rs as well Maybe @shehabgamin can help with testing SailHQ Looks like @andygrove has made a PR for testing with comet (linked above): apache/datafusion-comet#1423 |
I made a PR for testing in delta here: Still has some issues to work out |
I will test with Sail by Wednesday! |
I am running into one error in Comet: |
I'll open the changelog PR tomorrow |
Is there a reason why some
https://github.com/apache/datafusion/blob/main/Cargo.toml#L83-L95 |
Thanks @xudong963! I also pushed a note about the upgrade guide into the branch Would you like to try and make the release candidate now? datafusion/dev/release/README.md Lines 181 to 196 in 94d2baf
(I can do this too if you prefer) |
Yes, but need to wait a bit. I'm out now |
SOunds good -- let me know if you hit issues or want me to try. Thanks again for doing this! It will be great to have someone else with experience |
@alamb My pleasure I sent the release email to [email protected], did you see it? |
Yup! I am looking at it now |
I found an issue while validation: |
Update : the error with validation is related to a change in rustup, not the release candidate directly. I am working on a fix |
We have merged the fix here To simplify votiing I suggest we backport that fix to the Is this something you can do @xudong963 ? |
done @alamb |
I tested the upgrade on our system, and ran into bunch of runtime errors with Should ScalarUDFImpl::invoke_batch be marked as deprecated? (There is already a comment saying that in the docstring, but the compiler doesn't catch that 😅 ) That way we'd at least get some compile-time warning that we're doing something bad, now there was only runtime failures. |
Thanks @Blizzara ! I think we should mark it deprecated (though we had bad luck in the past with the compiler not complaining about implementing a deprecated trait method 🤔 ) We also tried to document this in the (new for the first time) upgrade guide: I think we may simply want to remove |
should use invoke_with_args instead See apache#14123 (comment)
Filed a PR here: #15049. Though I see there's already a release branch, how does that work?
Yea, in this case implementing the method is fine, since the way the method chain is setup means calling the newer methods automatically delegates to the older, deprecated methods if needed. However custom code calling those deprecated methods is not fine, and that'll be caught by properly marking the deprecation. The earlier case I noted around the implementing was for return_type stuff, where the chain isn't setup correctly (as it cannot really be as the args changed in an incompatible way).
Yup, that's where I originally saw this and wanted to test it, thanks! :) Still even with migration guide, people may not see or understand it, so avoiding silent runtime breaks is preferrable when ever possible (I'm sure we all agree on that, just saying it for completeness 😄 )
I'd +1 that, especially for the return_type things, but also for invokes. It'll be a compile break but that's better than any weirdness happening at runtime due to mismatching/wrong implementations. FWIW, after fixing all the invokes, all our tests seem to pass :) |
Realistically, I think what is on
Nice! |
@alamb i will catch a flight today and have a short vacation, do you have time to do the final release? |
Yes, absolutely -- thank you for all your help @xudong963 (I think a PMC member needs to do the final release anyways). |
The release was approved and published to crates.io See thread here: https://lists.apache.org/thread/rmvsc1ffpqd2z3m0c9lptjypzl83k2j8 I have filed a ticket for 47 I also have a small PR with cleanups from this release: Thanks again @xudong963 for running this release |
* fix: mark ScalarUDFImpl::invoke_batch as deprecated should use invoke_with_args instead See #14123 (comment) * fix deprecated usage that clippy warns about * fix another deprecated usage that clippy warns about * fix the rest of benches * fix two more implementations - now all that's left is in udf.rs * fix clippy * cleanup some leftover comments
I mostly just like to see changes in Github. We can merge this once Datafusion 46 released, assuming it includes the following PRs: - apache/datafusion#14754 - apache/datafusion#14671 - also hoping for apache/datafusion#14798 Seems like everything we wanted to get in made it, this branch now compiles AND passes all checks, so we're only waiting for the actual release in apache/datafusion#14123 --------- Co-authored-by: Alexander Droste <[email protected]>
Is your feature request related to a problem or challenge?
Tracking ticket for next release, also a place to track desired inclusions
Previous release will be https://crates.io/crates/datafusion/45.0.0 (likely Feb 1, 2025) December 31, 2024 so next major release would be around March 1, 2025
Steps:
47.0.0
(April 2025) #15072Prior release tickets:
44.0.0
Release DataFusion45.0.0
#14008 (comment)45.0.0
Release DataFusion45.0.0
#14008 (comment)Bugs that would be good to fix
DataSourceExec
#14905fix: fetch is missed during EnforceDistribution #14207select count(), count(*)
does not work #14855insert into
cases #14572Nice to have but non blockers
FileSource
-specific repartitioning #14754count_all()
expr_fn function now displayed ascount(1)
rather thancount(*)
#14894Needs some traige (should it be a blocker?)
TODO
Describe the solution you'd like
No response
Describe alternatives you've considered
No response
Additional context
No response
The text was updated successfully, but these errors were encountered: