Variational parameter noise #240

albertz · 2022-11-11T11:39:00Z

E.g. Lingvo has this, and they say it's important for bigger models or smaller datasets.

We also have it in RETURNN as param_variational_noise option for a layer.
However, here it makes sense to reimplement it.

Related: Weight dropout (#100), weight norm (#91)

The implementation in RETURNN:

      # Only apply this if we get a variable. Otherwise, maybe variational noise was already applied
      # (by some parent var scope), and we don't want to apply it twice.
      if param_variational_noise and param.dtype.is_floating and isinstance(param, tf.Variable):
        with default_control_flow_ctx():  # make independent from loop/cond
          with reuse_name_scope_of_tensor(param, postfix="_variational_noise", add_tensor_name=True):
            param = self.network.cond_on_train(
              fn_train=lambda: param + tf_compat.v1.random_normal(
                tf.shape(param), dtype=param.dtype.base_dtype,
                stddev=param_variational_noise,
                seed=self.network.random.randint(2 ** 31)),
              fn_eval=lambda: param)

In Lingvo:
https://github.com/tensorflow/lingvo/blob/65699192ba14521f330a1cae16141433453f1cbf/lingvo/tasks/asr/params/librispeech.py#L139
https://github.com/tensorflow/lingvo/blob/65699192ba14521f330a1cae16141433453f1cbf/lingvo/core/py_utils.py#L3691

The text was updated successfully, but these errors were encountered:

albertz · 2022-11-12T02:09:50Z

This is implemented now. See nn.variational_weight_noise.

albertz mentioned this issue Nov 11, 2022

Weight dropout #100

Closed

albertz closed this as completed in ee72ecb Nov 12, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Variational parameter noise #240

Variational parameter noise #240

albertz commented Nov 11, 2022 •

edited

Loading

albertz commented Nov 12, 2022

Variational parameter noise #240

Variational parameter noise #240

Comments

albertz commented Nov 11, 2022 • edited Loading

albertz commented Nov 12, 2022

albertz commented Nov 11, 2022 •

edited

Loading