About tying the weights #165

johnny5550822 · 2016-11-15T00:38:33Z

Hi, I read the blog post, 'https://blog.twitter.com/2015/autograd-for-torch' and find the simplicity of using autograd, thanks! I have a question about the blog post,

In the example of autoencoder, the weight l4-l6 are tied to l3-l1.

-- Tie the weights in the decoding layers
l4.weight = l3.weight:t()
l4.gradWeight = l3.gradWeight:t()
l5.weight = l2.weight:t()
l5.gradWeight = l2.gradWeight:t()
l6.weight = l1.weight:t()
l6.gradWeight = l1.gradWeight:t()

Is this implicitly done in the autograd? If so, isn't the weights (for example, the ones in l4, are updated twice in one backpropagation? Since the loss are propagated through the network from l6-l1, and l4 and l1 share the weights, and isn't the weights will be updated twice?)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About tying the weights #165

About tying the weights #165

johnny5550822 commented Nov 15, 2016

About tying the weights #165

About tying the weights #165

Comments

johnny5550822 commented Nov 15, 2016