Medical imaging tutorial #172

moritzschwyzer · 2020-03-14T17:48:25Z

I created a medical imaging tutorial that shows how to work with X-ray DICOM data. For this matter, I updated the load_image function in vision.core to be able to load DICOM files using pydicom. For the tutorial, I created a small subset of the SIIM-ACR Pneumothorax Segmentation dataset with 250 DICOM files and a .csv file with labels. I informed the SIIM and they gave consent to create this subset and put it online. I'm currently hosting the .tgz file on a server of mine, but I think it would be a good idea to put it on a CDN together with the other datasets. I can forward to you the email with the permission statement from the SIIM to use their data if needed.

review-notebook-app · 2020-03-14T17:48:31Z

Check out this pull request on

You'll be able to see Jupyter notebook diff and discuss changes. Powered by ReviewNB.

sgugger · 2020-03-15T14:14:27Z

I would very much like vision.core to stay independent from pydicom to avoid the dependency on the main library (ultimately, fastai.medical will be a real submodule). In medical.imaging, you have a patch to Path objects to implement a dcmread method for opening them that could be useful.

We can host the tgz with our other datasets if the SIIM is okay with that.

moritzschwyzer · 2020-03-15T15:01:17Z

Hi Sylvain, yes I totally agree, it's much cleaner to keep it separately. I'd really like to use the vision functionality (data augmentation etc.), though. Do you have a recommendation how to integrate the DICOM loading in the most efficient way and still being able to use the vision functionality?

sgugger · 2020-03-15T15:13:26Z

I see you're using cls=PILImageBW in your data block. Why not create your own subclass of it, named DicomImageBW. You can have the code you used to load inside the .create method and it should work as is since you would still have the behavior of PILImageBW everywhere.

moritzschwyzer · 2020-03-15T19:29:02Z

Thanks a lot for your input, that was very helpful! I now created a subclass PILDicom based on PILBase. Additionally, I modified the dcmread function so that it can be used to in the PILDicom.create function. I made the according changes in the tutorial. How do you want to proceed with the small SIIM dataset? Will you download it from http://files.vedavimedical.com/siim_small.tgz and put it on your cdn?

review-notebook-app · 2020-03-15T19:38:09Z

View / edit / reply to this conversation on ReviewNB

sgugger commented on 2020-03-15T19:38:09Z
----------------------------------------------------------------

Could we hide the output of this cell by storing the predictions in some variable? There is no real point seeing it.

moritzschwyzer commented on 2020-03-15T19:42:54Z
----------------------------------------------------------------

Sure! I just pushed a new version.

sgugger · 2020-03-15T19:40:05Z

Thanks a lot! Two things: I think we can hide one output in your notebook. And the second is I'm not sure you should change the dcmread command as PIL does not deal with int16 images AFAIK. All of this should be in your PILDicom only.

moritzschwyzer · 2020-03-15T20:16:05Z

I just found this closed issue pytorch/vision#105 that states that PIL handles int16 grayscale images. I now reverted the dcmread function to the old version and use the PIL conversion in the PILDicom.

sgugger · 2020-03-15T21:52:34Z

Thanks for making the changes. This is looking pretty great :)

sgugger · 2020-03-16T09:21:08Z

One follow up: the image you put as an attachment in the notebook did not arrive on GitHub. Can you put it in the images directory in another PR? Thanks!
I'll deal with the dataset today and adjust the url when I've put it on out server.

sgugger · 2020-03-16T09:22:48Z

Oh it is there properly, just not working withour doc building. Sorry, no need to do anything, will fix manually :)

moritzschwyzer added 17 commits March 5, 2020 16:41

made dataloader work for dicom

3a6c566

tutorial working

9983ae5

made xray training work

04119df

Merge remote-tracking branch 'upstream/master'

7aab396

made tutorial work correctly

fa0370c

fastai2 update

10caa4d

Merge remote-tracking branch 'upstream/master'

351da99

added SIIM_SMALL dataset

53f64e4

Merge remote-tracking branch 'upstream/master'

45b4f6d

restored vision/core

3ebc63a

fixed ImageBlock import

0315e39

integrated SIIM dataset into medical tutorial

182b858

fastai2 update

e2a61c1

implemented DICOM import in vision.core

603c30a

added SIIM folder structure image and updated medical imaging tutorial

75bc9dc

completed medical imaging tutorial

42a8f34

cleaned repository for pull request

640fcb3

moritzschwyzer added 2 commits March 15, 2020 20:16

implemented PILDicom and updated medical tutorial

dc07ebc

removed pydicom from vision.core

cfc2e27

moritzschwyzer added 2 commits March 15, 2020 20:40

created variable to store tta output

1561190

reverted dcmread function

1e93727

sgugger merged commit 430479f into fastai:master Mar 15, 2020

moritzschwyzer deleted the medical_imaging_tutorial branch March 16, 2020 10:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Medical imaging tutorial #172

Medical imaging tutorial #172

moritzschwyzer commented Mar 14, 2020

review-notebook-app bot commented Mar 14, 2020

sgugger commented Mar 15, 2020

moritzschwyzer commented Mar 15, 2020

sgugger commented Mar 15, 2020

moritzschwyzer commented Mar 15, 2020

review-notebook-app bot commented Mar 15, 2020 •

edited

Loading

sgugger commented Mar 15, 2020

moritzschwyzer commented Mar 15, 2020

sgugger commented Mar 15, 2020

sgugger commented Mar 16, 2020

sgugger commented Mar 16, 2020

Medical imaging tutorial #172

Medical imaging tutorial #172

Conversation

moritzschwyzer commented Mar 14, 2020

review-notebook-app bot commented Mar 14, 2020

sgugger commented Mar 15, 2020

moritzschwyzer commented Mar 15, 2020

sgugger commented Mar 15, 2020

moritzschwyzer commented Mar 15, 2020

review-notebook-app bot commented Mar 15, 2020 • edited Loading

sgugger commented Mar 15, 2020

moritzschwyzer commented Mar 15, 2020

sgugger commented Mar 15, 2020

sgugger commented Mar 16, 2020

sgugger commented Mar 16, 2020

review-notebook-app bot commented Mar 15, 2020 •

edited

Loading