-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Implement multitask training #25
Implement multitask training #25
Conversation
aa7178d
to
a3323e5
Compare
Codecov Report
@@ Coverage Diff @@
## master #25 +/- ##
=======================================
Coverage 80.94% 80.94%
=======================================
Files 17 17
Lines 1312 1312
=======================================
Hits 1062 1062
Misses 250 250 Continue to review full report at Codecov.
|
* Don't require an argument, just output the predictions on the validation set by default
e325e55
to
86817b2
Compare
* Remove confusing max_words parameter * Set max_len in init with default=250
86817b2
to
57e7588
Compare
Caused when laoding Rodrigues data with load_tsv
* Bumps parser model to 2020.3.8 * Bumps splitter model to 2020.3.6 - note that this model does not perform better than the model it replaces at present, but the previous model is not compatible with breaking API changes that have been implemented in the 2020.3.2 of deep_reference_parser. Nonetheless it will be relatively easy to experiment with the splitter model to get a higher score, and in any case this individual splitter model is mostly superseded by the multitask model, and just provided her for comparison.
keras_contrib.utils.load_save_utils is a soft wrapper around keras.saving which is not required in this relatively simple case.
@lizgzil I've made some progress on this today. I'm not sure what caused the mismatch in dims, but it seems to be resolved if using a different config/model ( I would be inclined to forget that model run, and to set the default to be Summary of what I have done today:
|
@ivyleavedtoadflax woop! great the mismatch issue isn't happening :) So do you think once this PR is closed, then I should train a model using the config for 2020.3.18 (i.e. use adam)? But for now I will just use the 2020.3.19 model if I need to. |
Yes, I would train a new model entirely. I've been doing some experimenting, and what is having more of an effect than anything is the sequence length. In this PR I added some logic to give us more fine grained control of sequence length both in data generation and in the model itself, and it had affected performance, so I think we might need to do a bit of experimentation to work out the best settings. But for now, yes I would just use a model that works, to get the logic working. |
The `2020.3.19_multitask` model will replace `2020.3.18` as the default model.
Add multitask 3.18 tsvs to datasets in Makefile
…llcometrust/deep_reference_parser into feature/ivyleavedtoadflax/multitask_2
This is a more reliable indicator that prediction size
NOTE: SplitParser functionality is not completely implemented, so these tests will need to be edited once it is.
Hey @lizgzil spent a little more time on this today:
I think now that tests are passing, I'd be inclined to merge this in even though the |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good apart from that data/multitask
comment. I'm happy for you to merge this and then open the smaller PR
🤦 yep - will fix |
What this PR contains
split_parse
command that follows the same logic as thesplit
andparse
commands.load_tsv
more robust to quotes in input data.indices.pickle
. Note that this change may cause issues when attempting to load earlier (now outdated) model versions.split
,parse
andsplit_parse
commands.NOTE: This version includes changes to both the way that model artefacts are packaged and saved, and the way that data are laded and parsed from tsv files. This results in a significantly faster training time (c.14 hours -> c.0.5 hour), but older models will no longer be compatible. For compatibility you must use multitask models > 2020.3.19, splitting models > 2020.3.6, and parisng models > 2020.3.8. These models currently perform less well than previous versions (#27), but performance is expected to improve with experimentation predominantly around sequence length, and more annotated data.
How you can test it