Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: adds REINFORCE algorithm #357

Open
wants to merge 11 commits into
base: main
Choose a base branch
from

Commits on Oct 25, 2024

  1. Init reinforce

    Signed-off-by: Alexander Bukharin <[email protected]>
    Alexander Bukharin committed Oct 25, 2024
    Configuration menu
    Copy the full SHA
    08c3a4f View commit details
    Browse the repository at this point in the history
  2. Remove ratio eps from actor

    Signed-off-by: Alexander Bukharin <[email protected]>
    Alexander Bukharin committed Oct 25, 2024
    Configuration menu
    Copy the full SHA
    9e0f9cf View commit details
    Browse the repository at this point in the history
  3. Remove ratio eps from actor

    Signed-off-by: Alexander Bukharin <[email protected]>
    Alexander Bukharin committed Oct 25, 2024
    Configuration menu
    Copy the full SHA
    6e5d0fc View commit details
    Browse the repository at this point in the history
  4. Fix infer rm call

    Signed-off-by: Alexander Bukharin <[email protected]>
    Alexander Bukharin committed Oct 25, 2024
    Configuration menu
    Copy the full SHA
    0f31586 View commit details
    Browse the repository at this point in the history
  5. Debug reward shape

    Signed-off-by: Alexander Bukharin <[email protected]>
    Alexander Bukharin committed Oct 25, 2024
    Configuration menu
    Copy the full SHA
    7a37f00 View commit details
    Browse the repository at this point in the history
  6. Debug reward shape

    Signed-off-by: Alexander Bukharin <[email protected]>
    Alexander Bukharin committed Oct 25, 2024
    Configuration menu
    Copy the full SHA
    f873ce3 View commit details
    Browse the repository at this point in the history
  7. Debug reward shape

    Signed-off-by: Alexander Bukharin <[email protected]>
    Alexander Bukharin committed Oct 25, 2024
    Configuration menu
    Copy the full SHA
    063aed5 View commit details
    Browse the repository at this point in the history
  8. Add REINFORCE documentation

    Signed-off-by: Alexander Bukharin <[email protected]>
    Alexander Bukharin committed Oct 25, 2024
    Configuration menu
    Copy the full SHA
    7bb859f View commit details
    Browse the repository at this point in the history
  9. Update CHANGELOG

    Signed-off-by: Alexander Bukharin <[email protected]>
    Alexander Bukharin committed Oct 25, 2024
    Configuration menu
    Copy the full SHA
    0f61e18 View commit details
    Browse the repository at this point in the history
  10. [pre-commit.ci] auto fixes from pre-commit.com hooks

    for more information, see https://pre-commit.ci
    
    Signed-off-by: Alexander Bukharin <[email protected]>
    pre-commit-ci[bot] authored and Alexander Bukharin committed Oct 25, 2024
    Configuration menu
    Copy the full SHA
    8a1cf44 View commit details
    Browse the repository at this point in the history
  11. Update REINFORCE example

    Signed-off-by: Alexander Bukharin <[email protected]>
    Alexander Bukharin committed Oct 25, 2024
    Configuration menu
    Copy the full SHA
    e851b6d View commit details
    Browse the repository at this point in the history