-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
With learning rebase #52
base: master
Are you sure you want to change the base?
With learning rebase #52
Conversation
active learning with girder worker Show good_prob row Add active learning persisted state
@dzenanz This is more or less working for me, but I haven't really been able to test it since I don't know where to find a dataset with the new fields. Even if you import a file in the original csv format, I think there are going to be issues when it goes looking for the data in csv format in Girder, as I have not yet converted everything in the PR to work with JSON. In fact, I'm not sure that's even the right way to go for what it's doing. But I think the next step would be to put together a dataset in the format it expects and see where we start hitting errors. Once we see what's going on, we can decide if converting all internal expectations to JSON is what we want to do. Looks like it wants two new fields in the data (associated with each scan): But you can look at that expectation yourself if you check out the You can feel free to push more commits on this branch, or pull it to your fork and work there, up to you. |
@dzenanz As I'm working more with this, I see there are still issues I can fix without a dataset containing the image quality metrics. I'm working on that now, and will push another commit when I sort out the issues. |
The refactoring of some of the |
All of the learning modules seem to assume the data format is Thoughts @dzenanz @curtislisle? |
I don't have a strong preference. Do what you think is easier or better 😄 Or wait for Curt's opinion. |
I opted to write a converter from json back to csv. It seems to be working to pull the json representation out of girder, convert it to csv, and pass this into the learning component. I learned more about how this workflow runs as I worked through issues today. I tried to encapsulate the key steps to reproduce it in the Also, everytime I click the "RETRAIN" button in the application, I see the following errors in the celery output:
That seems like it may be caused by having an older
... which is obviously because I don't have all the important files in the mriqc master directory. |
Add a note indicating that the slash is needed at the end of the mriqc path (we should fix this, but it works this way for now). Also, I added the --no-sub argument when running mriqc so that we avoid submission of the computed metrics back to mriqc.nimh.nih.gov, at least while we are doing a lot of testing of our workflow.
: used in time separation (e.g. 16:10:20) is not allowed in file name on Windows.
This adds a radio group to the retrain dialog, allowing the user to select neural network or random forest classifaction. Also adds a placeholder for neural network learning in the tasks.py module. Also fixes the bug where the retrain dialog "Cancel" button does not work, and adds a note to dev doc on running "data2mriqc.py".
WARNING/MainProcess] m:\dev\zarr\miqa\mriqc\mriqc\data_loader.py:112: SettingWithCopyWarning: A value is trying to be set on a copy of a slice from a DataFrame See the caveats in the documentation: https://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy self.data['good_prob'][i] = predictions[ind]
NN classifier seems to be working on my computer. Can you check my work in #54? |
Thanks @dzenanz I'll try to take a look at this tomorrow. In the meantime I'm just curious, why create a new PR? |
I didn't want to impose on your branch. You can cherry pick commits from my branch. |
Rebase the original work adding learning to the application, and fix a couple console errors