-
Notifications
You must be signed in to change notification settings - Fork 341
Pull requests: pytorch/rl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Feature] RayReplayBuffer
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2835
opened Mar 6, 2025 by
vmoens
Loading…
[Tutorial] LLM integration
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2832
opened Mar 5, 2025 by
vmoens
Loading…
[Feature] Macro-actions for LLMs
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2831
opened Mar 5, 2025 by
vmoens
Loading…
[Feature] vllm wrapper
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2830
opened Mar 5, 2025 by
vmoens
Loading…
Fixed VideoRecorder crash when passing fps
bug
Something isn't working
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2827
opened Mar 4, 2025 by
AlexandreBrown
Loading…
3 tasks done
[Feature] transformers policy
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2825
opened Mar 4, 2025 by
vmoens
Loading…
[Feature] batch_size, reward, done, attention_key in LLMEnv
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2824
opened Mar 4, 2025 by
vmoens
Loading…
[Feature] DensifyReward postproc
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
#2823
opened Mar 3, 2025 by
vmoens
Loading…
[Feature] DataLoadingPrimer.repeat
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
#2822
opened Mar 3, 2025 by
vmoens
Loading…
[DEBUG] ppo compile
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2814
opened Feb 27, 2025 by
IvanKobzarev
Loading…
10 tasks
[Feature,Deprecation] Split KLRewardTransform in more modules
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2813
opened Feb 27, 2025 by
vmoens
Loading…
[DRAFT, Example] Add MCTS example
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Examples
#2796
opened Feb 19, 2025 by
kurtamohler
•
Draft
[DRAFT] ppo chess with llm and ConditionalPolicySwitch to sunfish bot
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2763
opened Feb 5, 2025 by
mikaylagawarecki
•
Draft
[Feature] ConditionalPolicySwitch transform
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
#2711
opened Jan 21, 2025 by
vmoens
Loading…
[Example] Self-play chess PPO example
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Examples
#2709
opened Jan 21, 2025 by
vmoens
Loading…
[WIP] Compute lp during loss execution
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2688
opened Jan 10, 2025 by
vmoens
Loading…
[CI] Fix conda on windows
CI
Has to do with CI setup (e.g. wheels & builds, tests...)
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2676
opened Dec 20, 2024 by
vmoens
Loading…
10 tasks
[Tutorial] MCTS
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2673
opened Dec 19, 2024 by
vmoens
Loading…
First draft for modular Hindsight Experience Replay Transform
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
[Tutorial] Beam search with GPT models
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
tutorials
#2623
opened Dec 2, 2024 by
vmoens
Loading…
[Feature] PPOTrainer
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2550
opened Nov 11, 2024 by
vmoens
Loading…
[Feature] habitat env from config
bug
Something isn't working
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
#2539
opened Nov 6, 2024 by
vmoens
Loading…
10 tasks
[CI] Fix windows upload wheels
CI
Has to do with CI setup (e.g. wheels & builds, tests...)
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2507
opened Oct 21, 2024 by
vmoens
Loading…
[Feature] Gymnasium 1.0 compatibility
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Environments
Adds or modifies an environment wrapper
#2473
opened Oct 9, 2024 by
vmoens
Loading…
[Examples] boiler plate code for multi-turn reward for RLHF
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
#2467
opened Oct 5, 2024 by
rghosh08
Loading…
3 of 10 tasks
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.