-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Added Colab TPU support with Colab Notebook and modified repo #47
base: master
Are you sure you want to change the base?
Conversation
Hello, |
@aditya-malte Thanks for your contribution. It would be nice if you could do the following:
|
Hello @kimiyoung, |
Working perfectly for IMDB dataset for max_seq =128 and batch_size 64. Currently testing how far I can push the Colab TPU by increasing max_seq and/or batch_size |
Gives near SOTA results for eval_accuracy(IMDB) result equal to 0.9512. with batch size 32 and max_seq=256(Increasing max_seq further throws OOM error). Training and evaluation combined get completed within an hour or so. |
Sync with source repo
Hello @kimiyoung , Best Regards, |
Hello @kimiyoung ,
Thanks |
Sync(update) with source repo changes
The above merge PR, is in order to update my repo with all the changes made in yours |
Update train.py
I have made a colab notebook to allow for easy use of Google's Colab TPU.
The same has been tried and test successfully for Colab TPU.
(This includes additional things such as downloading weights to gcs buckets but keeping spiece file locally)
Thank you