About Original Nth Farthest #3

vanzytay · 2018-10-20T11:20:16Z

Hey!

I've been wondering if you have tried the original Nth Farthest code (from Sonnet) on a 16GB Ram GPU. I keep running into memory errors no matter what I do (on a Volta GPU).

Wondering if you have any clue. (Sorry this is not directly related to your repository), just wondering if you got the original Sonnet version to work.

Thanks!

L0SG · 2018-10-20T13:36:31Z

Hi!

Sadly I have not tried to run the official Sonnet code myself, and just ported the core implementation of RMC with the Sonnet code just for the reference. So I'm afraid i could not share any useful pointers.

The Nth farthest task implementation is from the contributor. (Mind if I ask about this topic, @jessicayung ?)

Side note here, I've been running train_nth_farthest on my TITAN Xp about 5 days now, and checking now, it did break the 25% barrier and reached 91%. Haven't logged anything so I'll try to compare the results with the one form the paper soon.

vanzytay · 2018-10-20T15:36:36Z

Hey! Thanks for the quick reply!

Wow, thanks. I'll use your version in my experiments!

vanzytay · 2018-10-22T05:57:00Z

@L0SG Hey there, one more question. I started running your N-Farthest script. It seems to still hover around 0.25 (it's been 1 day). Could you describe if there is a sudden spike in performance (to 91%) or at roughly how many epoch does it take to reach somewhere along that score! And is the default hyperparameters correct for achieving this result? Thanks!

L0SG · 2018-10-22T16:39:58Z

I've fired up the code and let it run forever and actually forgot about it for like 5 days. And checking it after seeing your issue, it was reaching 91% at around ~180000 epochs. The original paper says a wall clock time of breaking the 25% mark at around 40~50 hours, so running the code for at least this time period is a viable choice I suppose. Regarding to the default hyperparameters, I've not checked every last details of them yet, but I believe that the contributor took a great effort for matching them as faithful as possible. Currently I'm doing another project (not related to sequence unfortunately :( ), so I'll double check the faithfulness when I have a spare time. Meanwhile, if you could find the difference btw the Sonnet and this repo, please let me know and I'll fix it. Thanks!

jessicayung · 2018-10-22T16:57:23Z

@vanzytay The hyperparameters in this implementation were set based on the paper first and the official Sonnet implementation second. Not sure if there were differences between the two. Let me know if you find any problems. I spoke with one of the authors and they did say that the RRNN tends to run for a while before having something like an 'aha' moment and having a spike in performance (as shown in the graphs in the paper).

Also really glad to hear that the implementation's broken the 25% barrier, thanks for running it for longer Sang-gil!

vanzytay · 2018-10-22T17:19:37Z

Thanks @L0SG and @jessicayung for your replies!

L0SG · 2018-11-13T08:04:35Z

I've uploaded a bit overdue experimental results of the nth farthest task. Definitely takes way longer than the reported results from the paper. I will play with other hyperparameters when I have spare GPU resources available.

vanzytay · 2018-11-13T08:31:49Z

@L0SG Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About Original Nth Farthest #3

About Original Nth Farthest #3

vanzytay commented Oct 20, 2018

L0SG commented Oct 20, 2018 •

edited

Loading

vanzytay commented Oct 20, 2018

vanzytay commented Oct 22, 2018

L0SG commented Oct 22, 2018 via email •

edited

Loading

jessicayung commented Oct 22, 2018

vanzytay commented Oct 22, 2018

L0SG commented Nov 13, 2018

vanzytay commented Nov 13, 2018

About Original Nth Farthest #3

About Original Nth Farthest #3

Comments

vanzytay commented Oct 20, 2018

L0SG commented Oct 20, 2018 • edited Loading

vanzytay commented Oct 20, 2018

vanzytay commented Oct 22, 2018

L0SG commented Oct 22, 2018 via email • edited Loading

jessicayung commented Oct 22, 2018

vanzytay commented Oct 22, 2018

L0SG commented Nov 13, 2018

vanzytay commented Nov 13, 2018

L0SG commented Oct 20, 2018 •

edited

Loading

L0SG commented Oct 22, 2018 via email •

edited

Loading