fix(pu): fix noise layer's usage based on the original paper #866

puyuan1996 · 2025-04-17T07:28:01Z

Description

This pull request fixes the usage of Noisy Net in accordance with the original Noisy Net paper.

The key modifications are as follows:

Add set_noise_mode Function:
A new helper function, set_noise_mode, is introduced to control whether the noise is enabled (enable_noise). This function is used to update noise settings in the network.
Add _reset_noise Method in DQN:
A new _reset_noise method has been added to the DQN implementation. During each training step, the noise is reset and the corresponding noise is applied.
Model Weight and Noise Settings:
- The training model, collection model, and evaluation model share the same weights.
- During each training step, the model resets the noise and applies new noise. Once training steps conclude and the collection step begins, the noise added is the same as the one from the last training step.
- For the evaluation model, no noise is applied.
Experimental Result:
After fixing the Noisy Net implementation to be consistent with the paper's description, experimental results indicate that there is no significant performance difference whether Noisy Net is used or not.

Related Issue

[Issue #850](Noisy Net Issue #850)

Check List

Merge the latest version of the source branch/repo and resolve all conflicts
Pass style check
Pass all tests

ding/model/template/q_learning.py

ding/policy/dqn.py

PaParaZz1 · 2025-04-18T07:00:12Z

ding/policy/dqn.py

@@ -248,6 +248,8 @@ def _forward_learn(self, data: List[Dict[str, Any]]) -> Dict[str, Any]:
        .. note::
            For more detailed examples, please refer to our unittest for DQNPolicy: ``ding.policy.tests.test_dqn``.
        """
+        set_noise_mode(self._learn_model, True)


use noisy_net to control this line

Another question: how to deal with target_model in noisy net

ding/policy/common_utils.py

ding/torch_utils/network/nn_module.py

ding/policy/dqn.py

PaParaZz1 · 2025-06-03T13:56:25Z

ding/policy/rainbow.py

@@ -201,6 +202,11 @@ def _forward_learn(self, data: dict) -> Dict[str, Any]:
        # ====================
        self._learn_model.train()
        self._target_model.train()
+
+        # Set noise mode for NoisyNet for exploration in learning if enabled in config
+        set_noise_mode(self._learn_model, True)


why not use self._cfg.noisy_net to control this logic

puyuan added 2 commits April 17, 2025 07:18

fix(pu): fix noise layer's usage

5a01fde

polish(pu): polish comments

454334c

puyuan1996 added the bug Something isn't working label Apr 17, 2025

PaParaZz1 requested changes Apr 18, 2025

View reviewed changes

puyuan1996 added 4 commits May 29, 2025 11:07

polish(pu): polish noisy_net config

ee07a99

fix(pu): fix reset_noise bug in noisy_net option

41c810d

fix(pu): fix enable_noise bug in rainbow

681488c

style(pu): yapf format

688bfd7

puyuan1996 changed the title ~~fix(pu): fix noise layer's usage~~ fix(pu): fix noise layer's usage based on the original paper Jun 3, 2025

style(pu): yapf format

ad1a2a9

puyuan1996 mentioned this pull request Jun 3, 2025

Noisy Net Issue #850

Open

puyuan1996 added 2 commits June 3, 2025 12:24

style(pu): flake8 format

3133049

style(pu): yapf format

83b5fbe

PaParaZz1 approved these changes Jun 3, 2025

View reviewed changes

PaParaZz1 mentioned this pull request Jun 3, 2025

Roadmap for DI-engine #548

Open

puyuan1996 added 2 commits June 3, 2025 22:08

polish(pu): polish set_noise_mode when self._cfg.noisy_net is False

a6be7d4

fature(pu): add unittest for noise_linear_layer

44588d0

PaParaZz1 merged commit cf72cc0 into main Jun 3, 2025
19 of 33 checks passed

PaParaZz1 deleted the fix-noise branch June 3, 2025 14:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(pu): fix noise layer's usage based on the original paper #866

fix(pu): fix noise layer's usage based on the original paper #866

Uh oh!

puyuan1996 commented Apr 17, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

PaParaZz1 Apr 18, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

PaParaZz1 Jun 3, 2025

Uh oh!

Uh oh!

Uh oh!

fix(pu): fix noise layer's usage based on the original paper #866

fix(pu): fix noise layer's usage based on the original paper #866

Uh oh!

Conversation

puyuan1996 commented Apr 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related Issue

Check List

Uh oh!

Uh oh!

Uh oh!

Uh oh!

PaParaZz1 Apr 18, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

PaParaZz1 Jun 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

puyuan1996 commented Apr 17, 2025 •

edited

Loading