Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

OLMO + RL #424

Draft
wants to merge 48 commits into
base: olmo_again
Choose a base branch
from
Draft

OLMO + RL #424

wants to merge 48 commits into from

Commits on Oct 1, 2024

  1. first pass

    hamishivi committed Oct 1, 2024
    Configuration menu
    Copy the full SHA
    b444e80 View commit details
    Browse the repository at this point in the history

Commits on Oct 2, 2024

  1. fix spelling, ground truth stuff

    Hamish Ivison committed Oct 2, 2024
    Configuration menu
    Copy the full SHA
    f0569a3 View commit details
    Browse the repository at this point in the history
  2. fix misspelling

    Hamish Ivison committed Oct 2, 2024
    Configuration menu
    Copy the full SHA
    8e0f517 View commit details
    Browse the repository at this point in the history

Commits on Oct 4, 2024

  1. count verifieds and intermediate saving

    Hamish Ivison committed Oct 4, 2024
    Configuration menu
    Copy the full SHA
    6eebf7f View commit details
    Browse the repository at this point in the history

Commits on Oct 7, 2024

  1. save intermediate steps

    Hamish Ivison committed Oct 7, 2024
    Configuration menu
    Copy the full SHA
    f9a0b3c View commit details
    Browse the repository at this point in the history

Commits on Oct 8, 2024

  1. small fix to logging

    Hamish Ivison committed Oct 8, 2024
    Configuration menu
    Copy the full SHA
    bad1933 View commit details
    Browse the repository at this point in the history

Commits on Oct 9, 2024

  1. fix bug for forward rollout batching

    Hamish Ivison committed Oct 9, 2024
    Configuration menu
    Copy the full SHA
    028315d View commit details
    Browse the repository at this point in the history

Commits on Oct 12, 2024

  1. support gsm8k and math, more flexibility in future

    Hamish Ivison committed Oct 12, 2024
    Configuration menu
    Copy the full SHA
    faa7dc0 View commit details
    Browse the repository at this point in the history

Commits on Oct 16, 2024

  1. add costas plo thing

    Hamish Ivison committed Oct 16, 2024
    Configuration menu
    Copy the full SHA
    5970243 View commit details
    Browse the repository at this point in the history

Commits on Oct 21, 2024

  1. add numina math

    Hamish Ivison committed Oct 21, 2024
    Configuration menu
    Copy the full SHA
    f8fb8eb View commit details
    Browse the repository at this point in the history

Commits on Oct 23, 2024

  1. remove plo, add value model rand init, first stab at rephrase model l…

    …oading
    Hamish Ivison committed Oct 23, 2024
    Configuration menu
    Copy the full SHA
    eda4849 View commit details
    Browse the repository at this point in the history
  2. math strict verify

    Hamish Ivison committed Oct 23, 2024
    Configuration menu
    Copy the full SHA
    b709f37 View commit details
    Browse the repository at this point in the history
  3. Merge branch 'main' into verifiable-rewards

    Hamish Ivison committed Oct 23, 2024
    Configuration menu
    Copy the full SHA
    a328946 View commit details
    Browse the repository at this point in the history

Commits on Oct 24, 2024

  1. ifeval code

    Hamish Ivison committed Oct 24, 2024
    Configuration menu
    Copy the full SHA
    51f0b2a View commit details
    Browse the repository at this point in the history
  2. ifeval debug

    Hamish Ivison committed Oct 24, 2024
    Configuration menu
    Copy the full SHA
    79ec960 View commit details
    Browse the repository at this point in the history
  3. incorporate val fixes

    Hamish Ivison committed Oct 24, 2024
    Configuration menu
    Copy the full SHA
    b1b47bf View commit details
    Browse the repository at this point in the history
  4. data fixed, remove skips

    Hamish Ivison committed Oct 24, 2024
    Configuration menu
    Copy the full SHA
    d61038e View commit details
    Browse the repository at this point in the history

Commits on Oct 29, 2024

  1. Prototype ppo + ray (#390)

    * Prototype ppo + ray
    
    * reduce gradient
    
    * push
    
    * push changes
    
    * quick push
    
    * cache changes; this actually works with 6 nodes
    
    * push changes
    
    * psuh changes
    
    * push the latest change
    
    * push changes
    
    * Fix uploading
    
    * Make style
    
    * style and quality
    
    * update docs
    
    * update mason.py
    
    * log wandb tables
    
    * update docs
    
    * make style quality
    
    * make sure to save the right thing
    
    * push changes
    
    * push
    
    * push
    
    * push changes
    
    * push
    
    * push
    
    * fix
    
    * remove preemption code
    
    * fix
    
    * push changes
    
    * push
    
    * quick fix
    
    * quick push
    vwxyzjn authored Oct 29, 2024
    Configuration menu
    Copy the full SHA
    77f619d View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    b9de634 View commit details
    Browse the repository at this point in the history

Commits on Oct 30, 2024

  1. add weka save override

    hamishivi committed Oct 30, 2024
    Configuration menu
    Copy the full SHA
    f6a2b75 View commit details
    Browse the repository at this point in the history
  2. add multinode ray file

    hamishivi committed Oct 30, 2024
    Configuration menu
    Copy the full SHA
    b59659a View commit details
    Browse the repository at this point in the history

Commits on Oct 31, 2024

  1. lint and fix

    hamishivi committed Oct 31, 2024
    Configuration menu
    Copy the full SHA
    36a2ed4 View commit details
    Browse the repository at this point in the history

Commits on Nov 1, 2024

  1. first stab at flan

    hamishivi committed Nov 1, 2024
    Configuration menu
    Copy the full SHA
    527c51f View commit details
    Browse the repository at this point in the history

Commits on Nov 5, 2024

  1. Configuration menu
    Copy the full SHA
    f037460 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    ed18615 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    bdc3fa6 View commit details
    Browse the repository at this point in the history
  4. quick change

    vwxyzjn committed Nov 5, 2024
    Configuration menu
    Copy the full SHA
    63a4449 View commit details
    Browse the repository at this point in the history

Commits on Nov 7, 2024

  1. Add ability to use alternate image for safety eval (#422)

    * add ability to use alternate image
    
    * rever
    hamishivi authored Nov 7, 2024
    Configuration menu
    Copy the full SHA
    2bc1772 View commit details
    Browse the repository at this point in the history
  2. Adding final nc configs for v3.9 (#416)

    * final configs
    
    * update
    jacob-morrison authored Nov 7, 2024
    Configuration menu
    Copy the full SHA
    de33290 View commit details
    Browse the repository at this point in the history

Commits on Nov 8, 2024

  1. Configuration menu
    Copy the full SHA
    3cfc9e2 View commit details
    Browse the repository at this point in the history
  2. update OLMo code

    vwxyzjn committed Nov 8, 2024
    Configuration menu
    Copy the full SHA
    ebdf456 View commit details
    Browse the repository at this point in the history

Commits on Nov 11, 2024

  1. Unseen evals n the oe-eval.sh script (#425)

    * unseen evals
    
    * default value
    
    * conditional
    
    * flag for unseen evals
    pdasigi authored Nov 11, 2024
    Configuration menu
    Copy the full SHA
    ef2f3ec View commit details
    Browse the repository at this point in the history
  2. Minor additions to the decontamination script (#421)

    * safety evals and parquet files
    
    * import Dataset
    pdasigi authored Nov 11, 2024
    Configuration menu
    Copy the full SHA
    404d933 View commit details
    Browse the repository at this point in the history
  3. update data dist plots (#410)

    * update data dist plots
    
    * nit
    
    * smooth operation
    
    * updates
    
    * nits
    
    * clean git
    
    * cleaning for final SFT version
    natolambert authored Nov 11, 2024
    Configuration menu
    Copy the full SHA
    863b808 View commit details
    Browse the repository at this point in the history
  4. push changes

    vwxyzjn committed Nov 11, 2024
    Configuration menu
    Copy the full SHA
    3422229 View commit details
    Browse the repository at this point in the history
  5. Ability to set oe-eval priority (#423)

    * oe eval priority
    
    * up
    
    ---------
    
    Co-authored-by: Nathan Lambert <[email protected]>
    hamishivi and natolambert authored Nov 11, 2024
    Configuration menu
    Copy the full SHA
    1b44d61 View commit details
    Browse the repository at this point in the history
  6. Last fix for unseen evals. (#426)

    * fix and add script
    
    * update
    
    * fix copilot typo
    natolambert authored Nov 11, 2024
    Configuration menu
    Copy the full SHA
    8de53e6 View commit details
    Browse the repository at this point in the history
  7. More systematic and reproducible conversion of SFT datasets (#387)

    * Reorganize the data preparation scripts for tulu v1 and v2.
    
    * Minor improvement
    
    * Remove open_platypus_commercial subset from Daring-Anteater
    
    * Use hard-coded examples repo.
    
    * Fix some bugs.
    
    * Add OpenMathInstruct.
    
    * Add a few more v3.5.x SFT mix ablations for the cleaner datasets.
    
    * More experiments on mixes.
    
    * help merge
    
    * prep for merge
    
    * reapply changes
    
    * fix naming
    
    ---------
    
    Co-authored-by: Nathan Lambert <[email protected]>
    yizhongw and natolambert authored Nov 11, 2024
    Configuration menu
    Copy the full SHA
    f43d69a View commit details
    Browse the repository at this point in the history

Commits on Nov 12, 2024

  1. Use vllm for MMLU Pro (#428)

    * Use vllm for all evaluations
    
    * Do not use VLLM only for MMLU and TruthfulQA
    pdasigi authored Nov 12, 2024
    Configuration menu
    Copy the full SHA
    dd16008 View commit details
    Browse the repository at this point in the history

Commits on Nov 13, 2024

  1. Olmo1124ForCausalLM config. (#432)

    * Quick change
    
    * weight converter
    vwxyzjn authored Nov 13, 2024
    Configuration menu
    Copy the full SHA
    b17443e View commit details
    Browse the repository at this point in the history
  2. Support weka evaluation oe eval (#435)

    * Support weka eval
    
    * quick fix
    vwxyzjn authored Nov 13, 2024
    Configuration menu
    Copy the full SHA
    fe2817d View commit details
    Browse the repository at this point in the history
  3. Olmo1124converter (#434)

    * Quick change
    
    * weight converter
    
    * Add olmo1124 converter
    vwxyzjn authored Nov 13, 2024
    Configuration menu
    Copy the full SHA
    7fcbcfa View commit details
    Browse the repository at this point in the history

Commits on Nov 14, 2024

  1. mmlu cot added (#429)

    nouhadziri authored Nov 14, 2024
    Configuration menu
    Copy the full SHA
    db4c0a1 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    27a9b9d View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    d8bc8dc View commit details
    Browse the repository at this point in the history

Commits on Nov 15, 2024

  1. Configuration menu
    Copy the full SHA
    90b821c View commit details
    Browse the repository at this point in the history
  2. push changes

    vwxyzjn committed Nov 15, 2024
    Configuration menu
    Copy the full SHA
    7905e63 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    918b701 View commit details
    Browse the repository at this point in the history