Skip to content

Pull requests: pytorch/rl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[Feature] RayReplayBuffer CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2835 opened Mar 6, 2025 by vmoens Loading…
[Tutorial] LLM integration CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2832 opened Mar 5, 2025 by vmoens Loading…
[Feature] Macro-actions for LLMs CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2831 opened Mar 5, 2025 by vmoens Loading…
[Feature] vllm wrapper CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2830 opened Mar 5, 2025 by vmoens Loading…
Fixed VideoRecorder crash when passing fps bug Something isn't working CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2827 opened Mar 4, 2025 by AlexandreBrown Loading…
3 tasks done
[Feature] transformers policy CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2825 opened Mar 4, 2025 by vmoens Loading…
[Feature] batch_size, reward, done, attention_key in LLMEnv CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2824 opened Mar 4, 2025 by vmoens Loading…
[Feature] DensifyReward postproc CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. enhancement New feature or request
#2823 opened Mar 3, 2025 by vmoens Loading…
[Feature] DataLoadingPrimer.repeat CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. enhancement New feature or request
#2822 opened Mar 3, 2025 by vmoens Loading…
[DEBUG] ppo compile CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2814 opened Feb 27, 2025 by IvanKobzarev Loading…
10 tasks
[Feature,Deprecation] Split KLRewardTransform in more modules CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2813 opened Feb 27, 2025 by vmoens Loading…
[DRAFT, Example] Add MCTS example CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Examples
#2796 opened Feb 19, 2025 by kurtamohler Draft
[DRAFT] ppo chess with llm and ConditionalPolicySwitch to sunfish bot CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2763 opened Feb 5, 2025 by mikaylagawarecki Draft
[Feature] ConditionalPolicySwitch transform CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. enhancement New feature or request
#2711 opened Jan 21, 2025 by vmoens Loading…
[Example] Self-play chess PPO example CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Examples
#2709 opened Jan 21, 2025 by vmoens Loading…
[WIP] Compute lp during loss execution CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2688 opened Jan 10, 2025 by vmoens Loading…
[CI] Fix conda on windows CI Has to do with CI setup (e.g. wheels & builds, tests...) CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2676 opened Dec 20, 2024 by vmoens Loading…
10 tasks
[Tutorial] MCTS CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2673 opened Dec 19, 2024 by vmoens Loading…
First draft for modular Hindsight Experience Replay Transform CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. enhancement New feature or request
#2667 opened Dec 19, 2024 by dtsaras Draft
3 of 10 tasks
[Tutorial] Beam search with GPT models CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. tutorials
#2623 opened Dec 2, 2024 by vmoens Loading…
[Feature] PPOTrainer CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2550 opened Nov 11, 2024 by vmoens Loading…
[Feature] habitat env from config bug Something isn't working CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. enhancement New feature or request
#2539 opened Nov 6, 2024 by vmoens Loading…
10 tasks
[CI] Fix windows upload wheels CI Has to do with CI setup (e.g. wheels & builds, tests...) CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2507 opened Oct 21, 2024 by vmoens Loading…
[Feature] Gymnasium 1.0 compatibility CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Environments Adds or modifies an environment wrapper
#2473 opened Oct 9, 2024 by vmoens Loading…
[Examples] boiler plate code for multi-turn reward for RLHF CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. enhancement New feature or request
#2467 opened Oct 5, 2024 by rghosh08 Loading…
3 of 10 tasks
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.