-
Notifications
You must be signed in to change notification settings - Fork 336
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Swin UNETR Pretraining: HNSCC Data Extraction #189
Comments
I see now that each dataset is placed in its own directory ( |
Hi @coxjoseph , thanks so much for the question. How about this, we have a copy that are already converted to NIFTI, QAed and removed outliers. Thanks. |
Thanks for your great contribution. Hi @tangy5 , could you please provide more information on the TCIAcolon dataset as well? For example the mapping.json? I'm also having trouble to find the correct relation. |
I have a similar difficulty regarding reproducibility, and unfortunately, the link from tangy5 no longer works. I've downloaded the dataset HNSCC. Is this because the database was updated on request of the PI? Or did I miss something? Thank you very much in advance! |
When trying to pretrain the Swin Transformer model found in research-contributions/SwinUNETR/Pretrain/, I became aware of a discrepancy between the HNSCC json and the TCIA Colonography json.
The two json files downloaded from the links in the README (dataset_HNSCC_0.json and dataset_TCIAcolon_v2_0.json), while named correctly, both reference images in the
images
directory. At first I assumed that this just meant I had to somehow rename files from one dataset or renumber based on some ordering. Upon further inspection, however, the two files reference 602 of the same images in the same directory. Reading through the code, it does not seem that these images are handled any differently, leading me to believe that either one of the json files is linked incorrectly or the code is loading in multiple of the same images believing they are from different datasets. If the jsons are correct, could you please advise on how to rename/reorder the image files to correctly pretrain the model?Here's the short python script to validate that the two files are indeed reading the same images (place both json files in a subdirectory
jsons
relative to your working directory)> Found 602 paths in common.
The text was updated successfully, but these errors were encountered: