Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
The N Implementation Details of RLHF with PPO (#1580)
* The N Implementation Details of RLHF with PPO * quick change * fix format * update * quick update * Update the_n_implementation_details_of_rlhf_with_ppo.md Co-authored-by: Pedro Cuenca <[email protected]> * Update the_n_implementation_details_of_rlhf_with_ppo.md Co-authored-by: Pedro Cuenca <[email protected]> * Update the_n_implementation_details_of_rlhf_with_ppo.md Co-authored-by: Pedro Cuenca <[email protected]> * remove * update blog.yml * Update _blog.yml Co-authored-by: Leandro von Werra <[email protected]> --------- Co-authored-by: Pedro Cuenca <[email protected]> Co-authored-by: Leandro von Werra <[email protected]>
- Loading branch information