Added new Post-training an LLM using GRPO with TRL
recipe 🧑🍳️
#710
Job | Run time |
---|---|
34m 46s | |
34m 46s |
Post-training an LLM using GRPO with TRL
recipe 🧑🍳️
#710
Job | Run time |
---|---|
34m 46s | |
34m 46s |