-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error in is.factor(values) && levels(values) == c("FALSE", "TRUE") when using readH5AD #8
Comments
I think that should work with the current master branch. Can you install the current version from Github with @gtca Any chance we can update the package in Bioc? |
You are right. I installed the latest developmental version and the down-sampled (and simplified) data can be loaded without any errors or warnings. It seems that the full dataset can be loaded too. However, although currently there is no error messages, this process is extremely slow. it has been running for about 20 min and has not finished at the time of writing :(. The full dataset has about 100k cells x 2k genes and can be loaded by scanpy in less than 1 min. |
It looks like it's spending all that time inside rhdf5's |
It looks like it's spending most of the time reading some fairly large compound datasets e.g. I'm actually surprised that the python implementation is so fast. Using the
Do the python and R implementations actually do the same things? |
The file structure can be examined in python: import scanpy as sc
fca_head = sc.read_h5ad('s_fca_biohub_head_10x.h5ad')
fca_head.uns['leiden_res14.0__rank_genes_groups']['names']
|
They should for most things, but I have to admit we haven't paid a lot of attention to compound datatypes (presumably the equivalent to |
I guess my question was really "does scanpy actually read whatever |
|
Interesting, I wonder how come it's so much more efficient in python. As you said earlier, they all ultimately use |
Dear developers,
I am trying to read an h5ad file into R with
MuData
and encountered the following error:This is my command:
I am not sure what's going wrong. This dataset can be loaded by
scanpy.read_h5ad
in python. I also tried to down-sample a subset of cells of this dataset in python and save a new h5ad. The down-sampled file cannot be read in R either.The file
s_fca_biohub_head_10x.h5ad
is available from Fly cell atlas: https://cloud.flycellatlas.org/index.php/s/LAEybPc2HZnpzKsThe text was updated successfully, but these errors were encountered: