Use dynamic learning rate decay for convergence #39

begeekmyfriend · 2019-12-19T02:36:50Z

The evaluation sounds better than that with fixed learning rate.

Signed-off-by: begeekmyfriend [email protected]

Signed-off-by: begeekmyfriend <[email protected]>

seungwonpark · 2019-12-19T08:35:58Z

Hi, your code looks great, and thanks for kindly sending PR!
Can you please show the audio samples you got (w/ the number of epochs) for comparison?

begeekmyfriend · 2019-12-19T09:32:10Z

melgan_eval_mandarin.zip
I have synthesized voices from 4 anchors (1 male and 3 females). And the checkpoint is only at epoch 375 and still under training. I think it helps convergence.

Signed-off-by: begeekmyfriend <[email protected]>

delip · 2020-01-08T01:13:31Z

utils/train.py

@@ -13,18 +12,35 @@
 from .validation import validate


+def cosine_decay(init_val, final_val, step, decay_steps):
+    alpha = final_val / init_val


Shouldn't this be init_val / final_val?

The learning rate decays. You might write a demo for testing.

init_val = 1e-4 final_val = 1e-5

According to the following source it's "Minimum learning rate value as a fraction of learning_rate."
https://docs.w3cub.com/tensorflow~python/tf/train/cosine_decay/

Given the values, it looks like it's correct. The naming is just off - it should be the smallest value in the numerator and largest value in the denominator.

bob80333 · 2020-01-26T15:01:25Z

Is this different from pytorch's built-in CosineAnnealingLR?

Liujingxiu23 · 2021-01-28T09:02:53Z

@begeekmyfriend
Your tried different lr and found that the cos-lr is the best? And why it is suitable in melgan?
I am confused, when should we use unchanged lr, when should we use lr with decline，for example exponential decline in tacotron, and when shoud we use cos-lr.

begeekmyfriend · 2021-01-28T09:33:10Z

It is just a preference. Pick it or other as you like.

Liujingxiu23 · 2021-01-28T09:39:20Z

@begeekmyfriend Thank you for your quick reply. I used your branch of tacotron and found it is one of the best among a lot of code branchs. I will try the cos-lr as well as the apex in tfgan.

Use dynamic learning rate decay for convergence

cc1ec72

Signed-off-by: begeekmyfriend <[email protected]>

Fix loop counting

b450a91

Signed-off-by: begeekmyfriend <[email protected]>

delip reviewed Jan 8, 2020

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use dynamic learning rate decay for convergence #39

Use dynamic learning rate decay for convergence #39

begeekmyfriend commented Dec 19, 2019

seungwonpark commented Dec 19, 2019

begeekmyfriend commented Dec 19, 2019

delip Jan 8, 2020

begeekmyfriend Jan 8, 2020

casper-hansen Jan 26, 2020 •

edited

Loading

bob80333 commented Jan 26, 2020

Liujingxiu23 commented Jan 28, 2021

begeekmyfriend commented Jan 28, 2021

Liujingxiu23 commented Jan 28, 2021 •

edited

Loading

Use dynamic learning rate decay for convergence #39

Are you sure you want to change the base?

Use dynamic learning rate decay for convergence #39

Conversation

begeekmyfriend commented Dec 19, 2019

seungwonpark commented Dec 19, 2019

begeekmyfriend commented Dec 19, 2019

delip Jan 8, 2020

Choose a reason for hiding this comment

begeekmyfriend Jan 8, 2020

Choose a reason for hiding this comment

casper-hansen Jan 26, 2020 • edited Loading

Choose a reason for hiding this comment

bob80333 commented Jan 26, 2020

Liujingxiu23 commented Jan 28, 2021

begeekmyfriend commented Jan 28, 2021

Liujingxiu23 commented Jan 28, 2021 • edited Loading

casper-hansen Jan 26, 2020 •

edited

Loading

Liujingxiu23 commented Jan 28, 2021 •

edited

Loading