Skip to content

Large variance in convergence between T5 and T5.1 #960

Discussion options

You must be logged in to vote

This lists some differences between the versions.
https://github.com/google-research/text-to-text-transfer-transformer/blob/main/released_checkpoints.md
It also says you should reenable dropout when fine tuning on T5.1

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by pablogranolabar
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants