-
Notifications
You must be signed in to change notification settings - Fork 160
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
initial support for Dask DataFrames in obsm/varm #1880
base: main
Are you sure you want to change the base?
Conversation
8a4761f
to
2c3b39e
Compare
2c3b39e
to
982f882
Compare
The minimum_versions test is failing due to an incompatibility between the old dask version and Python 3.11.9 specifically, not sure what to do here. |
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #1880 +/- ##
==========================================
- Coverage 86.11% 83.60% -2.51%
==========================================
Files 40 40
Lines 6242 6258 +16
==========================================
- Hits 5375 5232 -143
- Misses 867 1026 +159
|
@ilia-kats Furthermore, we have reports from I had previously tried to do this, but there were several issues that made things quite unusable |
I like that the PR is basically write-only, but want to understand more. |
So my usecase is that I have several AnnData objects that I need to concatenate, and I want to do that as lazily as possible, without allocating memory, so what I'm doing is I'm converting everything in the AnnData objects to Dask equivalents and then running |
This was one of our stumbling blocks. It required a full pass over the data the last time we checked which kind of defeats the purpose. The above PR I linked to has full lazy concatenation features. |
That is good to know, I'll wait for that to be merged then, I suppose. |
This PR adds support for Dask DataFrames in
.obsm
/.varm
.