Skip to content

Commit

Permalink
The N Implementation Details of RLHF with PPO (#1580)
Browse files Browse the repository at this point in the history
* The N Implementation Details of RLHF with PPO

* quick change

* fix format

* update

* quick update

* Update the_n_implementation_details_of_rlhf_with_ppo.md

Co-authored-by: Pedro Cuenca <[email protected]>

* Update the_n_implementation_details_of_rlhf_with_ppo.md

Co-authored-by: Pedro Cuenca <[email protected]>

* Update the_n_implementation_details_of_rlhf_with_ppo.md

Co-authored-by: Pedro Cuenca <[email protected]>

* remove

* update blog.yml

* Update _blog.yml

Co-authored-by: Leandro von Werra <[email protected]>

---------

Co-authored-by: Pedro Cuenca <[email protected]>
Co-authored-by: Leandro von Werra <[email protected]>
  • Loading branch information
3 people authored Oct 24, 2023
1 parent da04037 commit 25bf01e
Show file tree
Hide file tree
Showing 3 changed files with 546 additions and 0 deletions.
10 changes: 10 additions & 0 deletions _blog.yml
Original file line number Diff line number Diff line change
Expand Up @@ -2979,3 +2979,13 @@
- gradio
- open-source
- serverless

- local: the_n_implementation_details_of_rlhf_with_ppo
title: "The N Implementation Details of RLHF with PPO"
author: vwxyzjn
thumbnail: /blog/assets/167_the_n_implementation_details_of_rlhf_with_ppo/thumbnail.png
date: October 24, 2023
tags:
- research
- rl
- rlhf
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading

0 comments on commit 25bf01e

Please sign in to comment.