Skip to content

Releases: araffin/sbx

SBX v0.18.0: Bug fix for SAC, optimize log of ent coeff to be consistent with SB3

01 Nov 07:48
1c79684
Compare
Choose a tag to compare

What's Changed

  • Optimize the log of the entropy coeff instead of the entropy coeff by @jamesheald in #56

New Contributors

Full Changelog: v0.17.0...v0.18.0

SBX v0.17.0: CNN support for DQN

11 Jul 12:07
19c85a1
Compare
Choose a tag to compare

What's Changed

  • Fix warning and remove DroQ class in favor of SAC config by @araffin in #47
  • Add CNN support for DQN by @araffin in #49

Full Changelog: v0.15.0...v0.17.0

SBX v0.15.0: Hotfix for offpolicy algorithms, the pseudo random key was not updated

12 Apr 12:02
42caa65
Compare
Choose a tag to compare

Note

No performance difference should be expected (See report in #46), this bug was introduced in v0.11.0.

What's Changed

  • Support for setting the target entropy by @jan1854 in #43
  • Hotfix - Return the new updated key in function _train by @theovincent in #46

New Contributors

Full Changelog: v0.13.0...v0.15.0

SBX v0.13.0: Added CrossQ algorithm and support for custom activations

03 Apr 10:21
c8db73f
Compare
Choose a tag to compare

Warning

Using DroQ class directly is deprecated and will be removed in SBX v0.14.0.
Please use SAC/TQC/CrossQ directly instead with the DroQ configuration, see https://github.com/araffin/sbx?tab=readme-ov-file#note-about-droq

To upgrade:

pip install sbx-rl --upgrade

CrossQ: https://openreview.net/forum?id=PczQtTsTIX (SAC with batch norm and no target network)

What's Changed

New Contributors

Full Changelog: v0.12.0...v0.13.0

SBX v0.12.0: Added support for MultiDiscrete and MultiBinary action spaces to PPO

28 Feb 21:40
db6120b
Compare
Choose a tag to compare

What's Changed

  • Support for MultiDiscrete and MultiBinary action spaces in PPO by @jan1854 in #30

Full Changelog: v0.11.0...v0.12.0

SBX v0.11.0: Added support for large values for gradient_steps to SAC, TD3, and TQC

09 Feb 08:51
e564074
Compare
Choose a tag to compare

What's Changed

  • Added support for large values for gradient_steps to SAC, TD3, and TQC by @jan1854 in #21

New Contributors

Full Changelog: v0.10.0...v0.11.0

SBX v0.10.0: Fix `train()` signature and update type hints

16 Jan 13:37
37ed771
Compare
Choose a tag to compare

What's Changed

  • Fix train signature and update type hints by @araffin in #24

Full Changelog: v0.9.1...v0.10.0

SBX v0.9.1: Fix replay buffer device at load time

13 Dec 16:02
ba597ca
Compare
Choose a tag to compare

What's Changed

  • Fix replay buffer device at load time by @araffin in #20

This issue was introduced with SB3 v2.2.1.

Full Changelog: v0.9.0...v0.9.1

SBX v0.9.0: Add flatten layer

18 Nov 15:21
9bd4bca
Compare
Choose a tag to compare

What's Changed

  • Add flatten layer and update dependencies by @araffin in #18

Full Changelog: v0.8.0...v0.9.0

SBX v0.8.0: Added DDPG and TD3

07 Sep 09:00
f662613
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.7.0...v0.8.0