-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[MAINTENANCE] Add extras to setup.cfg for optional deps #17
Conversation
4cc27fb
to
03302aa
Compare
03302aa
to
b92c91d
Compare
5bf0cee
to
8b68e7c
Compare
8b68e7c
to
c139d75
Compare
ruff==0.8.3 | ||
pytest==8.3.4 | ||
pytest-mock==3.14.0 | ||
great-expectations[spark, spark-connect]>=1.3.1 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do we really need spark
in lint
?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good question. We do because spark dataframes are in the type signature of at least one of the operators. I don't see a way around it, but LMK if you do.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ok...that leads me to ask the question: do we not support pandas for this operator?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We do support pandas. And that is part of the base gx install, so it's already included. I'll update the description on this PR, since these are good things to be clear about.
this looks great, thanks @tyler-hoffman . |
What this does
setup.cfg
Note on the mypy deps
Our type definitions for dataframes allow for both pandas and spark. The base gx install includes pandas, and the spark extras also need to be included for mypy to work.
What this doesn't do
CORE-730 says we should install the deps and perform a minimal proof that things work. I think the maintenance burden for a reasonable proof is sufficiently high to make this not worth the effort at the current time. We do have proof that the approach works, as the following extras are used by CI:
tests
,lint
,postgresql
,spark
.