Added new Post-training an LLM using GRPO with TRL
recipe 🧑🍳️
#697
Job | Run time |
---|---|
34m 45s | |
34m 45s |
Post-training an LLM using GRPO with TRL
recipe 🧑🍳️
#697
Job | Run time |
---|---|
34m 45s | |
34m 45s |