-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Nightly Arrow tests are failing because of datafusion.substrait no longer being available #36
Comments
I'll take a look at this. I think I just need to enable the feature in the CI nightly build. |
The tests on this repo are also failing I see, but so I am talking about the tests included in Arrow, which is managed by https://github.com/apache/arrow/blob/main/ci/scripts/install_substrait_consumer.sh, and is essentially doing a simple So I think for this to work, there will need to be a new release of datafusion that adds it again to the wheel (or, as mentioned, short term pin to 32.0 until there is a new release) |
Sorry I can't seem to find that while grepping the codebase? Am I overlooking something? Can you point me to the code snippet performing that pip install datafusion? I just made a PR to Arrow DataFusion Python to include substrait in the nightly builds and if that command is pulling from the nightlies that should hopefully help out. I opened a PR apache/datafusion-python#544 where we can at least start to understand if that will help this issue or not and discuss there. |
Yeah, I said "essentially doing a pip install" because you will indeed not find that by grepping... It's in the file I linked above, it loops over the packages in the requirements.txt file in this repo, and then installs those. And datafusion is in there: consumer-testing/requirements.txt Line 1 in e71acfe
But anyway, fixing this on the datafusion-python side as you are doing with apache/datafusion-python#544 is the proper fix I assume. |
Ahh ok yeah. Sorry, I saw |
Thanks for pointing this out @jorisvandenbossche! I completely forgot that the arrow nightlies used the dependencies in this repo. I had seen this issue last week, but only did the pinning in CI here. I'll push the pinning in the requirements.txt too. |
The arrow nightly build with substrait integration tests is currently failing because of a datafusion import problem (latest logs):
I think this might be caused by apache/datafusion-python#527, which appeared in datafusion 33.0.
(short term workaround to get the tests green again might be to pin datafusion to 32.0)
The text was updated successfully, but these errors were encountered: